Audio-Textual Emotion Recognition Based on Improved Neural Networks

المؤلفون المشاركون

Zhou, Sitong
Hu, Yaxin
Dong, Jiangong
Cai, Linqin

المصدر

Mathematical Problems in Engineering

العدد

المجلد 2019، العدد 2019 (31 ديسمبر/كانون الأول 2019)، ص ص. 1-9، 9ص.

الناشر

Hindawi Publishing Corporation

تاريخ النشر

2019-12-31

دولة النشر

مصر

عدد الصفحات

9

التخصصات الرئيسية

هندسة مدنية

الملخص EN

With the rapid development in social media, single-modal emotion recognition is hard to satisfy the demands of the current emotional recognition system.

Aiming to optimize the performance of the emotional recognition system, a multimodal emotion recognition model from speech and text was proposed in this paper.

Considering the complementarity between different modes, CNN (convolutional neural network) and LSTM (long short-term memory) were combined in a form of binary channels to learn acoustic emotion features; meanwhile, an effective Bi-LSTM (bidirectional long short-term memory) network was resorted to capture the textual features.

Furthermore, we applied a deep neural network to learn and classify the fusion features.

The final emotional state was determined by the output of both speech and text emotion analysis.

Finally, the multimodal fusion experiments were carried out to validate the proposed model on the IEMOCAP database.

In comparison with the single modal, the overall recognition accuracy of text increased 6.70%, and that of speech emotion recognition soared 13.85%.

Experimental results show that the recognition accuracy of our multimodal is higher than that of the single modal and outperforms other published multimodal models on the test datasets.

نمط استشهاد جمعية علماء النفس الأمريكية (APA)

Cai, Linqin& Hu, Yaxin& Dong, Jiangong& Zhou, Sitong. 2019. Audio-Textual Emotion Recognition Based on Improved Neural Networks. Mathematical Problems in Engineering،Vol. 2019, no. 2019, pp.1-9.
https://search.emarefa.net/detail/BIM-1194838

نمط استشهاد الجمعية الأمريكية للغات الحديثة (MLA)

Cai, Linqin…[et al.]. Audio-Textual Emotion Recognition Based on Improved Neural Networks. Mathematical Problems in Engineering No. 2019 (2019), pp.1-9.
https://search.emarefa.net/detail/BIM-1194838

نمط استشهاد الجمعية الطبية الأمريكية (AMA)

Cai, Linqin& Hu, Yaxin& Dong, Jiangong& Zhou, Sitong. Audio-Textual Emotion Recognition Based on Improved Neural Networks. Mathematical Problems in Engineering. 2019. Vol. 2019, no. 2019, pp.1-9.
https://search.emarefa.net/detail/BIM-1194838

نوع البيانات

مقالات

لغة النص

الإنجليزية

الملاحظات

Includes bibliographical references

رقم السجل

BIM-1194838