Comparative study of different types of RNN in speech classification

العناوين الأخرى

دراسة مقارنة لأنواع مختلفة من الشبكات العصبية المتكررة في تصنيف الكلام

المؤلفون المشاركون

Said, Tariq M.
Judi, Amr Muhammad Rifat
Hamid, Ayat Nabil

المصدر

The Egyptian Journal of Language Engineering

العدد

المجلد 8، العدد 1 (30 إبريل/نيسان 2021)، ص ص. 1-16، 16ص.

الناشر

الجمعية المصرية لهندسة اللغة

تاريخ النشر

2021-04-30

دولة النشر

مصر

عدد الصفحات

16

التخصصات الرئيسية

الهندسة الكهربائية

الموضوعات

الملخص EN

This paper introduces different models for pre-processing classification and their performance in Automatic Speech Recognition system.

Different Recurrent Neural Network (RNN) architectures have been tested for this problem, such as RNN cells (RNN), bidirectional RNN (BRNN), Long Short-Term Memory (LSTM), and bidirectional LSTM.

Mainly two features have been considered.

First, Mel frequency cepstral coefficient (MFCC) plus delta and delta-delta coefficients (39 parameters) have been used.

Second, MFCC quantization using Vector Quantization technique has been used as features.

All models have been trained on TIMIT database.

Vowels, nasals, Fricatives, plosives, and silences have been chosen as syllable classes for classification.

Experiment results show that BRNN-MFCC-5- {30, 30, 20, 25, 25} and BLSTM-MFCC -4- {30, 30, 25, 20} systems with MFCC plus delta and delta-delta coefficients (39 parameters) give the highest accuracy.

It achieved 92.6% and 92.07% , respectively.

vowels, nasals, and silences give the highest accuracy in BLSTM-MFCC -4- {30, 30, 25, 20}model with 98.5% , 83.6% and 93.7% , respectively.

Fricatives and plosives in BRNN-MFCC-5- {30, 30, 20, 25, 25} model with 89.7% and 66% , respectively.

نمط استشهاد جمعية علماء النفس الأمريكية (APA)

Hamid, Ayat Nabil& Judi, Amr Muhammad Rifat& Said, Tariq M.. 2021. Comparative study of different types of RNN in speech classification. The Egyptian Journal of Language Engineering،Vol. 8, no. 1, pp.1-16.
https://search.emarefa.net/detail/BIM-1254891

نمط استشهاد الجمعية الأمريكية للغات الحديثة (MLA)

Hamid, Ayat Nabil…[et al.]. Comparative study of different types of RNN in speech classification. The Egyptian Journal of Language Engineering Vol. 8, no. 1 (Apr. 2021), pp.1-16.
https://search.emarefa.net/detail/BIM-1254891

نمط استشهاد الجمعية الطبية الأمريكية (AMA)

Hamid, Ayat Nabil& Judi, Amr Muhammad Rifat& Said, Tariq M.. Comparative study of different types of RNN in speech classification. The Egyptian Journal of Language Engineering. 2021. Vol. 8, no. 1, pp.1-16.
https://search.emarefa.net/detail/BIM-1254891

نوع البيانات

مقالات

لغة النص

الإنجليزية

الملاحظات

-

رقم السجل

BIM-1254891