Convolutional neural network for Arabic speech recognition
Other Title(s)
استخدام الشبكة العصبية التلافيفية لتصميم نظام للتعرف علي الأصوات العربي
Joint Authors
Faruq, Hisham Muhammad
Radi, Inji Rajai
Hasan, Nabilah Muhammad
Hasan, Arafah Sabri
Source
The Egyptian Journal of Language Engineering
Issue
Vol. 8, Issue 1 (30 Apr. 2021), pp.27-38, 12 p.
Publisher
Egyptian Society of Language Engineering
Publication Date
2021-04-30
Country of Publication
Egypt
No. of Pages
12
Main Subjects
Information Technology and Computer Science
Topics
Abstract EN
This work is focused on single word Arabic automatic speech recognition (AASR).
Two techniques are used during the feature extraction phase; Log frequency spectral coefficients (MFSC) and Gammatone-frequency cepstral coefficients (GFCC) with their first and second-order derivatives.
The convolutional neural network (CNN) is mainly used to execute feature learning and classification process.
CNN achieved performance enhancement in automatic speech recognition (ASR).
Local connectivity, weight sharing, and pooling are the crucial properties of CNNs that have the potential to improve ASR.
We tested the CNN model using an Arabic speech corpus of isolated words.
The used corpus is synthetically augmented by applying different transformations such as changing the pitch, the speed, the dynamic range, adding noise, and forward and backward shift in time.
It was found that the maximum accuracy obtained when using GFCC with CNN is 99.77 % .
The outcome results of this work are compared to previous reports and indicate that CNN achieved better performance in AASR.
American Psychological Association (APA)
Radi, Inji Rajai& Faruq, Hisham Muhammad& Hasan, Nabilah Muhammad& Hasan, Arafah Sabri. 2021. Convolutional neural network for Arabic speech recognition. The Egyptian Journal of Language Engineering،Vol. 8, no. 1, pp.27-38.
https://search.emarefa.net/detail/BIM-1254893
Modern Language Association (MLA)
Radi, Inji Rajai…[et al.]. Convolutional neural network for Arabic speech recognition. The Egyptian Journal of Language Engineering Vol. 8, no. 1 (Apr. 2021), pp.27-38.
https://search.emarefa.net/detail/BIM-1254893
American Medical Association (AMA)
Radi, Inji Rajai& Faruq, Hisham Muhammad& Hasan, Nabilah Muhammad& Hasan, Arafah Sabri. Convolutional neural network for Arabic speech recognition. The Egyptian Journal of Language Engineering. 2021. Vol. 8, no. 1, pp.27-38.
https://search.emarefa.net/detail/BIM-1254893
Data Type
Journal Articles
Language
English
Notes
-
Record ID
BIM-1254893