Speaker recognition using discrete wavelet transformation

العناوين الأخرى

تمييز المتحدث باستخدام محول الموجات المنفصل

الجامعة

جامعة آل البيت

الكلية

كلية الأمير الحسين بن عبد الله لتكنولوجيا المعلومات

القسم الأكاديمي

قسم علوم الحاسوب

دولة الجامعة

الأردن

الدرجة العلمية

ماجستير

تاريخ الدرجة العلمية

2010

الملخص الإنجليزي

The use of biometric information has been known widely for both person identification and security application, it is common knowledge that each person can be identified by the unique characteristics of one or more of person biometrics.

One of the biometric characteristics of that a person can be identified by his voice.

This research attempts to find the best discrete coefficient that can be selected from one or more level using neural network as classifier. The proposed speaker recognition system consists of three phases: preprocessing phase (two processes are performed on the sound, DC level removal and resize of sample for 2000 sample for each sound), feature extraction phase (features that distinguish each sound from another, in this research discrete wavelet transform is used for extracting features from sounds), and recognition phase (many classifiers could be used for speaker recognition, in this research supervised neural networks, Back propagation and LVQ are used as classifiers. This research is concerned with studying the feature extracted from discrete wavelet transformation using feed-forward Back propagation and learning vector quantization as classifiers, the neural networks trained with feature extracted from different levels of discrete wavelet transform (one level at a time).

The research illustrates the effected of using different percent (50 %, 40 %, 30 %, 20 %, 10 %) of each level instead of all feature, then compare their recognition ability and decide the best level in addition to the percent of features that are enough to give comparable results with respect to all feature set and decide the neural network architecture that gives the best recognition rate.

The data set consists of different sounds recorded from ten different persons (three female and seven males) using Microsoft sound recorder, each one pronounce five different statements.

In total, the data set consists of 50 samples (15 female and 35 male).

It was found that 50 % is enough for the recognition process, and level 3, 5 are the best levels to extract features at which the recognition rate reaches 96 % with Back propagation NN.

The system was trained and tested using cross-validation.

التخصصات الرئيسية

الهندسة الكهربائية

الموضوعات

الخوارزميات

عدد الصفحات

قائمة المحتويات

Table of contents.

Abstract.

Chapter One : introduction.

Chapter Two : literature survey.

Chapter Three : theoretical background.

Chapter Four : developed system methodology.

Chapter Five : assessment result.

Chapter Six : conclusion and future work.

References.

نمط استشهاد جمعية علماء النفس الأمريكية (APA)

Abu al-Adas, Firas Id Dayf Allah. (2010). Speaker recognition using discrete wavelet transformation. (Master's theses Theses and Dissertations Master). Al albayt University, Jordan
https://search.emarefa.net/detail/BIM-310853

نمط استشهاد الجمعية الأمريكية للغات الحديثة (MLA)

Abu al-Adas, Firas Id Dayf Allah. Speaker recognition using discrete wavelet transformation. (Master's theses Theses and Dissertations Master). Al albayt University. (2010).
https://search.emarefa.net/detail/BIM-310853

نمط استشهاد الجمعية الطبية الأمريكية (AMA)

لغة النص

الإنجليزية

نوع البيانات

رسائل جامعية

رقم السجل

BIM-310853

حفظتم الحفظ طباعة

قاعدة معامل التأثير والاستشهادات المرجعية العربي "ارسيف Arcif"

أضخم قاعدة بيانات عربية للاستشهادات المرجعية للمجلات العلمية المحكمة الصادرة في العالم العربي

مرصد "معرفة"
لقياس الإنتاج العلمي العربي

تقوم هذه الخدمة بالتحقق من التشابه أو الانتحال في الأبحاث والمقالات العلمية والأطروحات الجامعية والكتب والأبحاث باللغة العربية، وتحديد درجة التشابه أو أصالة الأعمال البحثية وحماية ملكيتها الفكرية. تعرف اكثر