Deep Learning-Based Amplitude Fusion for Speech Dereverberation

Joint Authors

Liu, Chunlei
Wang, Longbiao
Dang, Jianwu

Source

Discrete Dynamics in Nature and Society

Issue

Vol. 2020, Issue 2020 (31 Dec. 2020), pp.1-14, 14 p.

Publisher

Hindawi Publishing Corporation

Publication Date

2020-07-14

Country of Publication

Egypt

No. of Pages

14

Main Subjects

Mathematics

Abstract EN

Mapping and masking are two important speech enhancement methods based on deep learning that aim to recover the original clean speech from corrupted speech.

In practice, too large recovery errors severely restrict the improvement in speech quality.

In our preliminary experiment, we demonstrated that mapping and masking methods had different conversion mechanisms and thus assumed that their recovery errors are highly likely to be complementary.

Also, the complementarity was validated accordingly.

Based on the principle of error minimization, we propose the fusion between mapping and masking for speech dereverberation.

Specifically, we take the weighted mean of the amplitudes recovered by the two methods as the estimated amplitude of the fusion method.

Experiments verify that the recovery error of the fusion method is further controlled.

Compared with the existing geometric mean method, the weighted mean method we proposed has achieved better results.

Speech dereverberation experiments manifest that the weighted mean method improves PESQ and SNR by 5.8% and 25.0%, respectively, compared with the traditional masking method.

American Psychological Association (APA)

Liu, Chunlei& Wang, Longbiao& Dang, Jianwu. 2020. Deep Learning-Based Amplitude Fusion for Speech Dereverberation. Discrete Dynamics in Nature and Society،Vol. 2020, no. 2020, pp.1-14.
https://search.emarefa.net/detail/BIM-1153074

Modern Language Association (MLA)

Liu, Chunlei…[et al.]. Deep Learning-Based Amplitude Fusion for Speech Dereverberation. Discrete Dynamics in Nature and Society No. 2020 (2020), pp.1-14.
https://search.emarefa.net/detail/BIM-1153074

American Medical Association (AMA)

Liu, Chunlei& Wang, Longbiao& Dang, Jianwu. Deep Learning-Based Amplitude Fusion for Speech Dereverberation. Discrete Dynamics in Nature and Society. 2020. Vol. 2020, no. 2020, pp.1-14.
https://search.emarefa.net/detail/BIM-1153074

Data Type

Journal Articles

Language

English

Notes

Includes bibliographical references

Record ID

BIM-1153074