Support vector machine text classifier for Arabic articles : ant colony optimization based feature subset selection

Dissertant

Muslih, Abd al-wadud Muhammad Abd al-Wadud

Thesis advisor

Kanan, Ghassan Jaddu

Comitee Members

al-Shalabi, Riyad
Abu Shawir, Bayan
al-Arif, Taha

University

Arab Academy for Financial and Banking Sciences

Faculty

The Faculty of Information Systems and Technology

Department

Computer information systems

University Country

Jordan

Degree

Ph.D.

Degree Date

2008

English Abstract

In this thesis, we have implemented a support vector machine (SVM) text classifier for Arabic articles.

Experimental results show that the SVM classifier outperformed Naive Bayesian (NB) and k-nearest neighbor (kNN) classifiers.

We investigated the effectiveness of six state-of-the-art feature subset selection (FSS) methods, which are commonly used in text classification (TC) tasks, for our Arabic SVM text classification system.

We implemented an Ant Colony Optimization Based-Feature Subset Selection (ACO Based-FSS) method for our Arabic SVM text classifier.

The proposed FSS method adapted Chi-square statistic as heuristic information and the effectiveness of the SVM classifier as a guide to improving the selection of features for each category.

Compared to the six state-of-the-art FSS methods, our ACO Based-FSS algorithm achieved better TC effectiveness.

Evaluation used an in-house Arabic TC corpus1 that consists of 1445 documents independently classified into nine categories.

The experimental results were presented in terms of macro-averaging precision, macro averaging recall and macro-averaging F1 measures.

Main Subjects

Information Technology and Computer Science

Topics

No. of Pages

114

Table of Contents

Table of contents.

Abstract.

Chapter one : Introduction.

Chapter two : Related work.

Chapter three : Text classification.

Chapter four : Feature subset selection.

Chapter five : Introduction to Arabic language.

Chapter six : Ant colony optimization based-feature subset selection algorithm for text classification tasks.

Chapter seven : Experimental results.

Chapter eight : Conclusions and future directions.

References.

American Psychological Association (APA)

Muslih, Abd al-wadud Muhammad Abd al-Wadud. (2008). Support vector machine text classifier for Arabic articles : ant colony optimization based feature subset selection. (Doctoral dissertations Theses and Dissertations Master). Arab Academy for Financial and Banking Sciences, Jordan
https://search.emarefa.net/detail/BIM-306127

Modern Language Association (MLA)

Muslih, Abd al-wadud Muhammad Abd al-Wadud. Support vector machine text classifier for Arabic articles : ant colony optimization based feature subset selection. (Doctoral dissertations Theses and Dissertations Master). Arab Academy for Financial and Banking Sciences. (2008).
https://search.emarefa.net/detail/BIM-306127

American Medical Association (AMA)

Muslih, Abd al-wadud Muhammad Abd al-Wadud. (2008). Support vector machine text classifier for Arabic articles : ant colony optimization based feature subset selection. (Doctoral dissertations Theses and Dissertations Master). Arab Academy for Financial and Banking Sciences, Jordan
https://search.emarefa.net/detail/BIM-306127

Language

English

Data Type

Arab Theses

Record ID

BIM-306127