A feature selection method based on binary black hole algorithm for spam E-mail filtering

مقدم أطروحة جامعية

Abd Allah, Amal Mahmud Abd Allah

مشرف أطروحة جامعية

al-Hilali, Adnan Hadi Mahdi

الجامعة

جامعة الإسراء

الكلية

كلية تكنولوجيا المعلومات

القسم الأكاديمي

قسم هندسة البرمجيات

دولة الجامعة

الأردن

الدرجة العلمية

ماجستير

تاريخ الدرجة العلمية

2019

الملخص الإنجليزي

For the past thirty-years, the using of the electronic mails (E-mails) played an important role for online communication all around the world.

In the meantime, e-mails messages are the main transaction method between millions of users.

Therefore, such an important transaction method has attracted the hackers to attack the users by faking the emails called “Spam”, which may contain danger files.

Feature selection algorithms studied according to the type of learning: supervised or unsupervised, this work is the supervised feature selection.

Filtering these emails is a classification problem, which is difficult task and a hot research area.

The classification performance of the machine learning models based on collected datasets still needs to be enhanced.

The most popular dataset is SPAMBASE, which consists of 57 features, some of these features are not important and need to be removed.

Selecting the important subset of features, enhances the classification performance and reduces the required time for training process.

The purpose of this thesis is presenting a machine learning approach for enhancing the accuracy of automatic spam detecting and filtering and separating them from legitimate messages.

The proposed algorithm is a hybrid filter and wrapper feature selection algorithm which has the ability to select the most relevant features.

The first part of the proposed algorithm is the information gain method, which represents the filter feature selection part.

While the wrapper method is represented by Black Hole (BH) algorithm.

BH algorithm is a recently developed algorithm for solving optimization problems, which mimics the universal phenomenon of black holes.

The proposed algorithm handles the features in a binary form where 1 represents the selected features, while 0 represents the unselected features.

The fitness for each star or solution was evaluated using naïve bayesian classifier (nbc), which indicates the black hole (i.e., best solution).

The algorithm has been experimented by using different scenarios, the binary hybrid filer-wrapper algorithm (BBH) enhances the accuracy of the email spam filtering system.

التخصصات الرئيسية

تكنولوجيا المعلومات وعلم الحاسوب

الموضوعات

عدد الصفحات

65

قائمة المحتويات

Table of contents.

Abstract.

Chapter One : Introduction.

Chapter Two : Theoretical background.

Chapter Three : The proposed algorithm.

Chapter Four : Results and disscusion.

Chapter Five : Conclusion and future works.

References.

نمط استشهاد جمعية علماء النفس الأمريكية (APA)

Abd Allah, Amal Mahmud Abd Allah. (2019). A feature selection method based on binary black hole algorithm for spam E-mail filtering. (Master's theses Theses and Dissertations Master). Isra University, Jordan
https://search.emarefa.net/detail/BIM-988680

نمط استشهاد الجمعية الأمريكية للغات الحديثة (MLA)

Abd Allah, Amal Mahmud Abd Allah. A feature selection method based on binary black hole algorithm for spam E-mail filtering. (Master's theses Theses and Dissertations Master). Isra University. (2019).
https://search.emarefa.net/detail/BIM-988680

نمط استشهاد الجمعية الطبية الأمريكية (AMA)

Abd Allah, Amal Mahmud Abd Allah. (2019). A feature selection method based on binary black hole algorithm for spam E-mail filtering. (Master's theses Theses and Dissertations Master). Isra University, Jordan
https://search.emarefa.net/detail/BIM-988680

لغة النص

الإنجليزية

نوع البيانات

رسائل جامعية

رقم السجل

BIM-988680