Robust visual lips feature extraction method for improved visual speech recognition system

Joint Authors

Ali, Wisam H.
Muhammad, Mahmud H.
Said, Thamir R.

Source

Engineering and Technology Journal

Issue

Vol. 36, Issue 2A (28 Feb. 2018), pp.136-145, 10 p.

Publisher

University of Technology

Publication Date

2018-02-28

Country of Publication

Iraq

No. of Pages

10

Main Subjects

Information Technology and Computer Science

Topics

Abstract EN

Recently, automatic lips reading ALR acquired a significant interest among many researchers due to its adoption in many applications.

One such application is in speech recognition system in noisy environment, where visual cue that contain some integral information added to the audio signal, as well as the way that person merges audio-visual stimulus to identify utterance.

The unsolved part of this problem is the utterance classification using only the visual cues without the availability of acoustic signal of the talker's speech.

By taking into considerations a set of frames from recorded video for a person uttering a word; a robust image processing technique is used to isolate the lips region, then suitable features are extracted that represent the mouth shape variation during speech.

These features are used by the classification stage to identify the uttered word.

This paper is solve this problem by introducing a new segmentation technique to isolate the lips region together with a set of visual features base on the extracted lips boundary which able to perform lips reading with significant result.

A special laboratory is designed to collect the utterance of twenty six English letters from a multiple speakers which are adopted in this paper (UOTEletters corpus).

Moreover; two type of classifier (using Numeral Virtual generalization (NVG) RAM and K nearest neighborhood KNN) where adopted to identify the talker’s utterance.

The recognition performance for the input visual utterance when using NVG RAM is 94.679%, which is utilized for the first time in this work.

While; 92.628% when KNN is utilize.

American Psychological Association (APA)

Muhammad, Mahmud H.& Said, Thamir R.& Ali, Wisam H.. 2018. Robust visual lips feature extraction method for improved visual speech recognition system. Engineering and Technology Journal،Vol. 36, no. 2A, pp.136-145.
https://search.emarefa.net/detail/BIM-832185

Modern Language Association (MLA)

Muhammad, Mahmud H.…[et al.]. Robust visual lips feature extraction method for improved visual speech recognition system. Engineering and Technology Journal Vol. 36, no. 2A (2018), pp.136-145.
https://search.emarefa.net/detail/BIM-832185

American Medical Association (AMA)

Muhammad, Mahmud H.& Said, Thamir R.& Ali, Wisam H.. Robust visual lips feature extraction method for improved visual speech recognition system. Engineering and Technology Journal. 2018. Vol. 36, no. 2A, pp.136-145.
https://search.emarefa.net/detail/BIM-832185

Data Type

Journal Articles

Language

English

Notes

Includes appendix : p. 145

Record ID

BIM-832185