Convolutional neural network for Arabic speech recognition

Other Title(s)

استخدام الشبكة العصبية التلافيفية لتصميم نظام للتعرف علي الأصوات العربي

Joint Authors

Faruq, Hisham Muhammad
Radi, Inji Rajai
Hasan, Nabilah Muhammad
Hasan, Arafah Sabri

Source

The Egyptian Journal of Language Engineering

Issue

Vol. 8, Issue 1 (30 Apr. 2021), pp.27-38, 12 p.

Publisher

Egyptian Society of Language Engineering

Publication Date

2021-04-30

Country of Publication

Egypt

No. of Pages

12

Main Subjects

Information Technology and Computer Science

Topics

Abstract EN

This work is focused on single word Arabic automatic speech recognition (AASR).

Two techniques are used during the feature extraction phase; Log frequency spectral coefficients (MFSC) and Gammatone-frequency cepstral coefficients (GFCC) with their first and second-order derivatives.

The convolutional neural network (CNN) is mainly used to execute feature learning and classification process.

CNN achieved performance enhancement in automatic speech recognition (ASR).

Local connectivity, weight sharing, and pooling are the crucial properties of CNNs that have the potential to improve ASR.

We tested the CNN model using an Arabic speech corpus of isolated words.

The used corpus is synthetically augmented by applying different transformations such as changing the pitch, the speed, the dynamic range, adding noise, and forward and backward shift in time.

It was found that the maximum accuracy obtained when using GFCC with CNN is 99.77 % .

The outcome results of this work are compared to previous reports and indicate that CNN achieved better performance in AASR.

American Psychological Association (APA)

Radi, Inji Rajai& Faruq, Hisham Muhammad& Hasan, Nabilah Muhammad& Hasan, Arafah Sabri. 2021. Convolutional neural network for Arabic speech recognition. The Egyptian Journal of Language Engineering،Vol. 8, no. 1, pp.27-38.
https://search.emarefa.net/detail/BIM-1254893

Modern Language Association (MLA)

Radi, Inji Rajai…[et al.]. Convolutional neural network for Arabic speech recognition. The Egyptian Journal of Language Engineering Vol. 8, no. 1 (Apr. 2021), pp.27-38.
https://search.emarefa.net/detail/BIM-1254893

American Medical Association (AMA)

Radi, Inji Rajai& Faruq, Hisham Muhammad& Hasan, Nabilah Muhammad& Hasan, Arafah Sabri. Convolutional neural network for Arabic speech recognition. The Egyptian Journal of Language Engineering. 2021. Vol. 8, no. 1, pp.27-38.
https://search.emarefa.net/detail/BIM-1254893

Data Type

Journal Articles

Language

English

Notes

-

Record ID

BIM-1254893