Deep Learning-Based Amplitude Fusion for Speech Dereverberation
Joint Authors
Liu, Chunlei
Wang, Longbiao
Dang, Jianwu
Source
Discrete Dynamics in Nature and Society
Issue
Vol. 2020, Issue 2020 (31 Dec. 2020), pp.1-14, 14 p.
Publisher
Hindawi Publishing Corporation
Publication Date
2020-07-14
Country of Publication
Egypt
No. of Pages
14
Main Subjects
Abstract EN
Mapping and masking are two important speech enhancement methods based on deep learning that aim to recover the original clean speech from corrupted speech.
In practice, too large recovery errors severely restrict the improvement in speech quality.
In our preliminary experiment, we demonstrated that mapping and masking methods had different conversion mechanisms and thus assumed that their recovery errors are highly likely to be complementary.
Also, the complementarity was validated accordingly.
Based on the principle of error minimization, we propose the fusion between mapping and masking for speech dereverberation.
Specifically, we take the weighted mean of the amplitudes recovered by the two methods as the estimated amplitude of the fusion method.
Experiments verify that the recovery error of the fusion method is further controlled.
Compared with the existing geometric mean method, the weighted mean method we proposed has achieved better results.
Speech dereverberation experiments manifest that the weighted mean method improves PESQ and SNR by 5.8% and 25.0%, respectively, compared with the traditional masking method.
American Psychological Association (APA)
Liu, Chunlei& Wang, Longbiao& Dang, Jianwu. 2020. Deep Learning-Based Amplitude Fusion for Speech Dereverberation. Discrete Dynamics in Nature and Society،Vol. 2020, no. 2020, pp.1-14.
https://search.emarefa.net/detail/BIM-1153074
Modern Language Association (MLA)
Liu, Chunlei…[et al.]. Deep Learning-Based Amplitude Fusion for Speech Dereverberation. Discrete Dynamics in Nature and Society No. 2020 (2020), pp.1-14.
https://search.emarefa.net/detail/BIM-1153074
American Medical Association (AMA)
Liu, Chunlei& Wang, Longbiao& Dang, Jianwu. Deep Learning-Based Amplitude Fusion for Speech Dereverberation. Discrete Dynamics in Nature and Society. 2020. Vol. 2020, no. 2020, pp.1-14.
https://search.emarefa.net/detail/BIM-1153074
Data Type
Journal Articles
Language
English
Notes
Includes bibliographical references
Record ID
BIM-1153074