Speech Separation Using Convolutional Neural Network and Attention Mechanism

المؤلفون المشاركون

Yuan, Chun-Miao
Sun, Xue-Mei
Zhao, Hu

المصدر

Discrete Dynamics in Nature and Society

العدد

المجلد 2020، العدد 2020 (31 ديسمبر/كانون الأول 2020)، ص ص. 1-10، 10ص.

الناشر

Hindawi Publishing Corporation

تاريخ النشر

2020-07-25

دولة النشر

مصر

عدد الصفحات

10

التخصصات الرئيسية

الرياضيات

الملخص EN

Speech information is the most important means of human communication, and it is crucial to separate the target voice from the mixed sound signals.

This paper proposes a speech separation model based on convolutional neural networks and attention mechanism.

The magnitude spectrum of the mixed speech signals, as the input, has its high dimensionality.

By analyzing the characteristics of the convolutional neural network and attention mechanism, it can be found that the convolutional neural network can effectively extract low-dimensional features and mine the spatiotemporal structure information in the speech signals, and the attention mechanism can reduce the loss of sequence information.

The accuracy of speech separation can be improved effectively by combining two mechanisms.

Compared to the typical speech separation model DRNN-2 + discrim, this method achieves 0.27 dB GNSDR gain and 0.51 dB GSIR gain, which illustrates that the speech separation model proposed in this paper has achieved an ideal separation effect.

نمط استشهاد جمعية علماء النفس الأمريكية (APA)

Yuan, Chun-Miao& Sun, Xue-Mei& Zhao, Hu. 2020. Speech Separation Using Convolutional Neural Network and Attention Mechanism. Discrete Dynamics in Nature and Society،Vol. 2020, no. 2020, pp.1-10.
https://search.emarefa.net/detail/BIM-1152871

نمط استشهاد الجمعية الأمريكية للغات الحديثة (MLA)

Yuan, Chun-Miao…[et al.]. Speech Separation Using Convolutional Neural Network and Attention Mechanism. Discrete Dynamics in Nature and Society No. 2020 (2020), pp.1-10.
https://search.emarefa.net/detail/BIM-1152871

نمط استشهاد الجمعية الطبية الأمريكية (AMA)

Yuan, Chun-Miao& Sun, Xue-Mei& Zhao, Hu. Speech Separation Using Convolutional Neural Network and Attention Mechanism. Discrete Dynamics in Nature and Society. 2020. Vol. 2020, no. 2020, pp.1-10.
https://search.emarefa.net/detail/BIM-1152871

نوع البيانات

مقالات

لغة النص

الإنجليزية

الملاحظات

Includes bibliographical references

رقم السجل

BIM-1152871