Title
RS-CAE-Based AR-Wiener Filtering and Harmonic Recovery for Speech Enhancement.
Abstract
By taking into account temporal correlation of speech feature. In this paper, a novel structure of convolutional Auto Encoder CAE was proposed. In this structure, the historical output of the CAE was fed into a CAE stack recurrently. We name this structure as Recurrent Stack Convolutional Auto Encoder RS-CAE. In the training stage, the training feature maps of the RS-CAE comprise of log power spectrum LPS of noisy speech and an additional feature map derived from the LPS of the enhanced speech in the history. In this way, the temporal correlation is incorporated as much as possible in the RS-CAE. The training target is a concatenated vector of auto-regressive AR model parameters of speech and noise. At online stage, the LPS of noisy speech and the LPS of the enhanced speech from the history make up input feature maps together. The outputs of the RS-CAE are the AR model parameters of speech and noise, which are used to construct the AR-Wiener filter. Because the estimated AR model parameters are not completely accurate and some harmonics may be lost in the enhanced speech, the codebook-based harmonic recovery technique was proposed to reconstruct harmonic structure of the enhanced speech. The test results confirmed that the proposed method achieved better performance compared with some existing approaches.
Year
DOI
Venue
2019
10.1109/TASLP.2019.2930914
IEEE/ACM Trans. Audio, Speech & Language Processing
Keywords
Field
DocType
Speech enhancement,Training,Noise measurement,Harmonic analysis,Convolutional codes,Feature extraction,Power harmonic filters
Wiener filter,Speech enhancement,Autoregressive model,Autoencoder,Pattern recognition,Computer science,Harmonic,Speech recognition,Spectral density,Harmonics,Artificial intelligence,Codebook
Journal
Volume
Issue
ISSN
27
11
2329-9290
Citations 
PageRank 
References 
0
0.34
11
Authors
2
Name
Order
Citations
PageRank
Yan Yang112122.47
Changchun Bao213346.05