Jordan birds recognition system using deep learning

Dissertant

al-Qabilat, Suhayb Turki Abd al-Rahman

Thesis advisor

al-Shurah, Sulayman Ayid Khalaf

University

Mutah University

Faculty

Information Technology College

Department

Computer Science Department

University Country

Jordan

Degree

Master

Degree Date

2020

Arabic Abstract

دور مراقبة الطيور هو الحفاظ على الطبيعة من خلال مراقبة سلوك الطيور ونمط الهجرة، و مع ذلك، فإن التحدي الذي يواجه مراقبي الطيور يمكن أن يتمثل في تحديد الطيور، و التي تبقى صعبة بسبب تشابه أشكال الطيور و قلة الخبرة في هذا المجال.

و من ثم، فإن مجال مراقبة الطيور يحتاج إلى أنظمة كمبيوتر تساعد مراقبي الطيور على التعرف على الطيور.

يحتاج التعرف على الطيور إلى عمل شاق في البحث، حيث أن محتوى صور الطيور محاط بتأثيرات أخرى تجعل مهمة التعرف ليست سهلة بالإضافة إلى ذلك، فإن الاختلافات البسيطة بين أنواع الطيور تجعل من الصعب التمييز بين الطيور.

لذلك، من أجل تحقيق مهمة الاعتراف، فإنها تتطلب ميزات فريدة.

تهدف هذه الدراسة إلى التحقق من استخدام التعلم العميق لنظام التعرف على الطيور في الأردن باستخدام 19-VGG لاستخراج الميزات من الصور 1998-VG هي إحدى الشبكات العصبية التلاقيفية (CNN) المدربة مسبقاً و التي تستخدم للتعرف على الصور و ايضا تم استخدامها في هذه الأطروحة لاستخراج الميزات من صور الطيور.

CNN هي فلة من الشبكات العصرية العميقة، و هي واحدة من أكثر الشبكات العصرية شيوعا في مجال التعرف على الصور ميزات CNN المستخرجة للصور تلقائية، حيث إنها لا تعمل مثل الإجراءات السابقة في الميزات المستخرجة يدويا.

تم تدريب 190-VGG على أكثر من مليون صورة من قاعدة بيانات ImageNet .

من أجل التعرف على طهور الأردن، صممنا قاعدة بياناتنا لصور الطيور التي تم جمعها في الأردن.

تحتوي قاعدة بياناتنا على 4340 صورة لـ 434 نوعا من الطيور تم الحصول عليها من مصادر علمية و حيث تم اعتمادها من قبل جمعية مراقبة الطيور الأردنية بناءً على اسمها العلمي.

في هذه الدراسة، الطبقتان في بنية VGG19 اللتان يستخدمان للحصول على الميزات هما الطبقة 6 (تسمى (fc6) والطبقة 7 (تسمى (7) تحتوي كل طبقة على 4096 معلما.

نظرا لأن حجم متجه الميزات العميقة الذي تم الحصول عليه من طبقات VGG19 (6) أو (7) كبير جدا (4096)، تم استخدام تحليل المكونات الرئيسية (PCA) للقيام بتقليل الأبعاد يؤدي هذا إلى إنشاء مجموعتي بيانات إضافيتين بميزات أقل من تلك التي تم الحصول عليها من الطبقة 6 أو الطبقة 7، وتستخدم مجموعات البيانات الأربع ذات التخفيض أو بدونه كمدخلات للتصنيف باستخدام المصنفات المختلفة لتعلم الآلة وهي الشبكات العصبية الاصطناعية (ANN) و KNN و Random Forest و Nave Bayes و Decision أيضا، تم إنشاء المزيد من نواقل المعالم لإنشاء مجموعات بيانات من متجهات السمات العميقة المذكورة أعلاه (تم الحصول عليها من الطبقتين 6 و 7) باستخدام متوسط وأننى وأقصى و دمج كلتا الطبقتين.

تشمل نتائج تحقيقنا، على سبيل المثال لا الحصر، ما يلي، إن PCA المستخدم في الميزات العميقة لا يقال فقط من الأبعاد، بل يقال وقت التدريب / الاختبار بشكل كبير، و يسمح أيضا بزيادة دقة تحديد الهوية، خاصة عند استخدام مصنف .

ANN بناءً على نتائج المصنفات : أظهرت ANN أن accuracy (70.9908) و recalls (0.718)precision و (F-Measure (708) مقارنة بالمصنفات الأخرى

English Abstract

The role of Birdwatching is to preserve the nature through observing birds behavior and migration pattern.

However, the challenge for bird watchers can be represented by identifying birds, which remains difficult due to the similarity of the forms of birds and the lack of experience in this field.

Hence, the field of Birdwatching needs computer systems that will help birdwatchers to identify birds.

Birds recognition needs hard work in the research, as the birds’ images content is surrounding with other effects that makes the recognition task is not easy.

In addition, the simple differences between species of birds make the recognition task is difficult to distinguish between birds.

So, in order to achieve the recognition task, it requires a unique features.

This study aims at investigating the use of deep learning for birds recognition system in Jordan using VGG-19 for extracting features from images.

VGG-19 is one of the pre-trained convolutional neural network (CNN) networks that used for image recognition which was used in this thesis to extract the features from birds images.

The CNN is a category of deep neural networks, which is one of the most common neural networks in the field of images recognition.

CNN's extracted features for image automatically, as it doesn’t work like previous procedures in extracting features manually.

The VGG-19 is trained on more than a million images from the ImageNet database.

For the sake of Jordan bird’s recognition, we designed our database of birds’ images that were collected in Jordan.

Our database is contained 4340 images of 434 bird species obtained from scientific sources and were approved by Jordanian Bird Watching Association based on their scientific names.

In this study, the two layers in the structure of VGG19 that used to get the features are layer 6 (called fc6) and layer 7 (called fc7); each of layer containing 4096 features.

Since the size of the deep feature vector obtained from the VGG19’s layers (6 or 7) is very large (4096), the Principal Component Analysis (PCA) was used to do the dimensionality reduction.

This generates two more datasets with fewer features than those obtained from layer 6 or layer 7.

The four datasets with and without reduction are used as input for classification using various machine learning classifiers, which are Artificial neural networks (ANN), KNN, Random Forest, Naive Bayes and Decision Tree.

Also, more feature vectors were created to generate more datasets from the aforementioned deep feature vectors (obtained from layers 6 and 7) using average, minimum, maximum and combine of both layers.

The results of our investigation include and not limited to the following, the PCA used on the deep features does not only reduce the dimensionality, and therefore, the training/testing time is reduced significantly, but also allows for increasing the identification accuracy, particularly when using the ANN classifier.

Based on the results of classifiers; ANN showed high classification accuracy (70.9908), precision (0.718), recall (0.71) and F-Measure (0.708) compared to other classifiers.

Main Subjects

Information Technology and Computer Science

No. of Pages

Table of contents.

Abstract.

Abstract in Arabic.

Chapter One : Introduction.

Chapter Two : Theoretical background and literature review.

Chapter Three : Design and methodology (Proposed Work).

Chapter Four : Results and discussions.

Chapter Five : Conclusion and future work.

References.

American Psychological Association (APA)

al-Qabilat, Suhayb Turki Abd al-Rahman. (2020). Jordan birds recognition system using deep learning. (Master's theses Theses and Dissertations Master). Mutah University, Jordan
https://search.emarefa.net/detail/BIM-1350781

Modern Language Association (MLA)

al-Qabilat, Suhayb Turki Abd al-Rahman. Jordan birds recognition system using deep learning. (Master's theses Theses and Dissertations Master). Mutah University. (2020).
https://search.emarefa.net/detail/BIM-1350781

American Medical Association (AMA)

Language

English

Data Type

Arab Theses

Record ID

BIM-1350781

SaveSaved Print

Arab Citation & Impact Factor "Arcif"

Largest Arabic Database of Citations Analysis for the Arabic Scholarly Journals Issued in Arab World.

eMarefa Indicators
for Arab Scientific Production

"Kashif" for Checking Similarity or Plagiarism in the Arabic Researches. know more

e-Marefa