In the process of signal acquisition by laser microphones, the high frequency components of speech are missing due to nonadditive distortion. In this paper, we proposed an end-to-end speech bandwidth extension (BWE) approach to recover narrow-band speech acquired by laser microphones. Our preliminary research showed that speech enhancement algorithms based on log-magnitude spectrogram in the frequency domain could not achieve satisfactory performance for this task. Therefore, we designed a speech BWE model in time domain, this model was modified by Wave-U-Net structure, we introduced the time convolution module (TCM), the dilation of convolution is helpful to increase receptive field, improves speech long-range correlation, at the same time introduced the multi-resolution loss function (LMSTFT) instead of the mean square error (MSE), the time-domain Wave-U-Net method avoided the decoupling of magnitude and phase in the frequency domain. The results showed that the signal-to-noise ratio (SNR) of speech was improved significantly compared with approach in the frequency domain, and obtained elaborate high-frequency components than frequency-domain convolutional recurrent network (CRN). We chose laser speech to test the model in an actual scene, which further verifies the practicability of the structure through the speech spectrum analysis, and had better performance and generalization ability than the original Wave-U-Net model.
Access to the requested content is limited to institutions that have purchased or subscribe to SPIE eBooks.
You are receiving this notice because your organization may not have SPIE eBooks access.*
*Shibboleth/Open Athens users─please
sign in
to access your institution's subscriptions.
To obtain this item, you may purchase the complete book in print or electronic format on
SPIE.org.
INSTITUTIONAL Select your institution to access the SPIE Digital Library.
PERSONAL Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.