Deep Learning-Based Amplitude Fusion for Speech Dereverberation

المؤلفون المشاركون

Liu, Chunlei
Wang, Longbiao
Dang, Jianwu

المصدر

Discrete Dynamics in Nature and Society

العدد

المجلد 2020، العدد 2020 (31 ديسمبر/كانون الأول 2020)، ص ص. 1-14، 14ص.

الناشر

Hindawi Publishing Corporation

تاريخ النشر

2020-07-14

دولة النشر

مصر

عدد الصفحات

14

التخصصات الرئيسية

الرياضيات

الملخص EN

Mapping and masking are two important speech enhancement methods based on deep learning that aim to recover the original clean speech from corrupted speech.

In practice, too large recovery errors severely restrict the improvement in speech quality.

In our preliminary experiment, we demonstrated that mapping and masking methods had different conversion mechanisms and thus assumed that their recovery errors are highly likely to be complementary.

Also, the complementarity was validated accordingly.

Based on the principle of error minimization, we propose the fusion between mapping and masking for speech dereverberation.

Specifically, we take the weighted mean of the amplitudes recovered by the two methods as the estimated amplitude of the fusion method.

Experiments verify that the recovery error of the fusion method is further controlled.

Compared with the existing geometric mean method, the weighted mean method we proposed has achieved better results.

Speech dereverberation experiments manifest that the weighted mean method improves PESQ and SNR by 5.8% and 25.0%, respectively, compared with the traditional masking method.

نمط استشهاد جمعية علماء النفس الأمريكية (APA)

Liu, Chunlei& Wang, Longbiao& Dang, Jianwu. 2020. Deep Learning-Based Amplitude Fusion for Speech Dereverberation. Discrete Dynamics in Nature and Society،Vol. 2020, no. 2020, pp.1-14.
https://search.emarefa.net/detail/BIM-1153074

نمط استشهاد الجمعية الأمريكية للغات الحديثة (MLA)

Liu, Chunlei…[et al.]. Deep Learning-Based Amplitude Fusion for Speech Dereverberation. Discrete Dynamics in Nature and Society No. 2020 (2020), pp.1-14.
https://search.emarefa.net/detail/BIM-1153074

نمط استشهاد الجمعية الطبية الأمريكية (AMA)

Liu, Chunlei& Wang, Longbiao& Dang, Jianwu. Deep Learning-Based Amplitude Fusion for Speech Dereverberation. Discrete Dynamics in Nature and Society. 2020. Vol. 2020, no. 2020, pp.1-14.
https://search.emarefa.net/detail/BIM-1153074

نوع البيانات

مقالات

لغة النص

الإنجليزية

الملاحظات

Includes bibliographical references

رقم السجل

BIM-1153074