Law cost training of hmm-based speech recognition or synthesis system with low size Arabic labeled unsegmented noisy speech database

Joint Authors

Nazmi, T. M.
Barakat, M. S.
Jad Allah, M. E.
al-Arif, T.

Source

International Journal of Intelligent Computing and Information Sciences

Issue

Vol. 8, Issue 1 (31 Jan. 2008)13 p.

Publisher

Ain Shams University Faculty of Computer and Information Sciences

Publication Date

2008-01-31

Country of Publication

Egypt

No. of Pages

13

Main Subjects

Information Technology and Computer Science

Topics

Abstract EN

The first step in the process of developing speech recognition and formant or statistical parametric speech synthesis applications is preparation of speech training database.

The recognition and synthesis quality depend crucially on speech database size and fidelity.

But, preparing large speech training data for these applications is very costly in effort and time since its recorded waveform files should be segmented and labeled according to the class of labels used in the system.

This paper discusses a way to overcome the problem of training data starvation and the high cost of preparing it by introducing a method for training Hidden Markov Model (HMM)-based system with low size Arabic labeled unsegmented noisy speech database.

This method can be used for developing HMM-based recognition or HMM-based synthesis system in which speech waveform is generated from HMMs themselves.

Subjective evaluation of proposed Arabic speech HMM-based synthesis system shows that also increasing the length of context-dependent models makes contextual factors more prominent and improves the formant structure of the generated phonemes but will have the disadvantage of poor generalization and perceivable clicks in the synthesized speech.

American Psychological Association (APA)

Barakat, M. S.& Jad Allah, M. E.& Nazmi, T. M.& al-Arif, T.. 2008. Law cost training of hmm-based speech recognition or synthesis system with low size Arabic labeled unsegmented noisy speech database. International Journal of Intelligent Computing and Information Sciences،Vol. 8, no. 1.
https://search.emarefa.net/detail/BIM-284840

Modern Language Association (MLA)

Barakat, M. S.…[et al.]. Law cost training of hmm-based speech recognition or synthesis system with low size Arabic labeled unsegmented noisy speech database. International Journal of Intelligent Computing and Information Sciences Vol. 8, no. 1 (Jan. 2008).
https://search.emarefa.net/detail/BIM-284840

American Medical Association (AMA)

Barakat, M. S.& Jad Allah, M. E.& Nazmi, T. M.& al-Arif, T.. Law cost training of hmm-based speech recognition or synthesis system with low size Arabic labeled unsegmented noisy speech database. International Journal of Intelligent Computing and Information Sciences. 2008. Vol. 8, no. 1.
https://search.emarefa.net/detail/BIM-284840

Data Type

Journal Articles

Language

English

Notes

Includes bibliographical references.

Record ID

BIM-284840