A feature selection method based on binary black hole algorithm for spam E-mail filtering

Dissertant

Abd Allah, Amal Mahmud Abd Allah

Thesis advisor

al-Hilali, Adnan Hadi Mahdi

University

Isra University

Faculty

Faculty of Information Technology

Department

Department Software Engineering

University Country

Jordan

Degree

Master

Degree Date

2019

English Abstract

For the past thirty-years, the using of the electronic mails (E-mails) played an important role for online communication all around the world.

In the meantime, e-mails messages are the main transaction method between millions of users.

Therefore, such an important transaction method has attracted the hackers to attack the users by faking the emails called “Spam”, which may contain danger files.

Feature selection algorithms studied according to the type of learning: supervised or unsupervised, this work is the supervised feature selection.

Filtering these emails is a classification problem, which is difficult task and a hot research area.

The classification performance of the machine learning models based on collected datasets still needs to be enhanced.

The most popular dataset is SPAMBASE, which consists of 57 features, some of these features are not important and need to be removed.

Selecting the important subset of features, enhances the classification performance and reduces the required time for training process.

The purpose of this thesis is presenting a machine learning approach for enhancing the accuracy of automatic spam detecting and filtering and separating them from legitimate messages.

The proposed algorithm is a hybrid filter and wrapper feature selection algorithm which has the ability to select the most relevant features.

The first part of the proposed algorithm is the information gain method, which represents the filter feature selection part.

While the wrapper method is represented by Black Hole (BH) algorithm.

BH algorithm is a recently developed algorithm for solving optimization problems, which mimics the universal phenomenon of black holes.

The proposed algorithm handles the features in a binary form where 1 represents the selected features, while 0 represents the unselected features.

The fitness for each star or solution was evaluated using naïve bayesian classifier (nbc), which indicates the black hole (i.e., best solution).

The algorithm has been experimented by using different scenarios, the binary hybrid filer-wrapper algorithm (BBH) enhances the accuracy of the email spam filtering system.

Main Subjects

Information Technology and Computer Science

Topics

No. of Pages

65

Table of Contents

Table of contents.

Abstract.

Chapter One : Introduction.

Chapter Two : Theoretical background.

Chapter Three : The proposed algorithm.

Chapter Four : Results and disscusion.

Chapter Five : Conclusion and future works.

References.

American Psychological Association (APA)

Abd Allah, Amal Mahmud Abd Allah. (2019). A feature selection method based on binary black hole algorithm for spam E-mail filtering. (Master's theses Theses and Dissertations Master). Isra University, Jordan
https://search.emarefa.net/detail/BIM-988680

Modern Language Association (MLA)

Abd Allah, Amal Mahmud Abd Allah. A feature selection method based on binary black hole algorithm for spam E-mail filtering. (Master's theses Theses and Dissertations Master). Isra University. (2019).
https://search.emarefa.net/detail/BIM-988680

American Medical Association (AMA)

Abd Allah, Amal Mahmud Abd Allah. (2019). A feature selection method based on binary black hole algorithm for spam E-mail filtering. (Master's theses Theses and Dissertations Master). Isra University, Jordan
https://search.emarefa.net/detail/BIM-988680

Language

English

Data Type

Arab Theses

Record ID

BIM-988680