The use of hybrid lvq3 neural networks for speaker recognition problems

Publication Date

2010-12-31

Country of Publication

Iraq

No. of Pages

Main Subjects

Engineering & Technology Sciences (Multidisciplinary)

Topics

Neural networks(Computer science)

Abstract AR

يهدف البحث إلى تطوير شبكة تعليم المتجه الكمي العصبية لحل مشاكل تمييز المتكلم أوتوماتيكيا.

يتكون النظام المقترح من مرحلتين أساسية هما مرحلتي التحليل و التمييز.

نفذت مرحلة التحليل باستخدام معاملات تحويل المويجة.

استخدمت الشبكات العصبية في مرحلة التمييز حيث تم بناء أربعة أنظمة مختلفة في مرحلة التمييز لتلائم نتائج المعاملات في مرحلة التحليل التي نفذت باستخدام محول المويجة الجيبي المتقطع.

نفذ النظام الأول باستخدام التطور الثالث لشبكة تعليم المتجه الكمي، أي بتعبير آخر تطبيق الشبكة العصبية الصرفة.

انحزت بقية الأنظمة المقترحة باستخدام دوال تنشيط مختلفة.

نفذ النظام الثاني بتجميع الدوال السيجماوية الثلاث في دالة واحدة.

نفذ النظام الثالث باستخدام دالة التانش أما النظام الرابع فقد نفذ باستخدام شبكة تحويل المويجة العصبية و وفقا للمشتقة الثانية لدالة هار.

تم تطبيق النظام على عشرة أشخاص (5 ذكور 5 إناث) و بأعمار متقاربة.

وضحت نتائج الاختبار معدلات نسب التمييز لخمسة تسجيلات للشخص لم يتم تدريبها مسبقا.

Abstract EN

The main objective of this work is to develop the learning vector quantization (LVQ) neural network for the speaker recognition problems.

The proposed methods consist of two main stages: the analysis stage and the recognition stage.

The analysis stage was achieved by using wavelet transform coefficients, The neural networks was used in the recognition stage.

The Four speaker recognition systems were differently constructed to appropriate the coefficients results in the stage of analysis which it was achieved by discrete wavelet packet transform (DWPT) coefficients.

The first system was implemented by using the third development of LVQ (LVQ3).

In other words it is pure neural network.

The rest of the proposed systems were implemented by using different proposed activation functions of this network.

The second system was implemented by combining the three sigmoidal functions in one function .The third system was implemented by a tansh function, and the last system was implemented using a wavenet network under which the second derivative of Haar wavelet function will be used.

The work was applied on 10 persons (5 males and 5 females) which have converge ages.

The results of the test showed that recognition rate averages for the five-recorded speaker statement which not previously trained.

American Psychological Association (APA)

al-Araji, Nabil Hashim Kaghid& al-Shamery, Iman S.. 2010. The use of hybrid lvq3 neural networks for speaker recognition problems. Journal of Babylon University : Journal of Applied and Pure Sciences،Vol. 18, no. 5, pp.1846-1851.
https://search.emarefa.net/detail/BIM-287359

Modern Language Association (MLA)

al-Araji, Nabil Hashim Kaghid& al-Shamery, Iman S.. The use of hybrid lvq3 neural networks for speaker recognition problems. Journal of Babylon University : Journal of Applied and Pure Sciences Vol. 18, no. 5 (2010 ), pp.1846-1851.
https://search.emarefa.net/detail/BIM-287359

American Medical Association (AMA)

al-Araji, Nabil Hashim Kaghid& al-Shamery, Iman S.. The use of hybrid lvq3 neural networks for speaker recognition problems. Journal of Babylon University : Journal of Applied and Pure Sciences. 2010. Vol. 18, no. 5, pp.1846-1851.
https://search.emarefa.net/detail/BIM-287359

Data Type

Journal Articles

Language

English

Notes

Includes bibliographical references : p. 1851

Record ID

BIM-287359

SaveSaved Print

Arab Citation & Impact Factor "Arcif"

Largest Arabic Database of Citations Analysis for the Arabic Scholarly Journals Issued in Arab World.

eMarefa Indicators
for Arab Scientific Production

"Kashif" for Checking Similarity or Plagiarism in the Arabic Researches. know more