Improving the performance of Naïve Bayes algorithm for Arabic text categorization

Other Title(s)

تحسين أداء خوارزمية (Naïve Bayes)‎ في تصنيف النصوص العربية

Dissertant

Abu Aqulah, Nibras Jamal

Thesis advisor

Qaqish, Malik
al-Mashayikhi, Akram Uthman

Comitee Members

Azami, Muayyad Abd al-Razzaq
al-Mashaqibah, Firas Faris

University

Amman Arab University

Faculty

Collage of Computer Sciences and Informatics

Department

Department of Computer Science

University Country

Jordan

Degree

Master

Degree Date

2010

English Abstract

In this research four techniques are implemented to find the best results using Naïve Bayes algorithm classifier for Arabic text categorization.

These techniques are:(Term Frequency (TF),Term Frequency –Inverse document frequency TF-IDF, Normalized TF-IDF, Normalized TF-IDF with N-Gram (N=2) statistical stemmer and threshold similarity 0.8). The four techniques are evaluated by two test sets.

The results showed that the Normalized TF-IDF with N-Gram with N=2 statistical stemmer with threshold similarity 0.8 technique had the best accuracy result, and the TF technique had the shortest running time

Main Subjects

Mathematics

Topics

No. of Pages

45

Table of Contents

Table of contents.

Abstract.

Abstract in Arabic.

Chapter One : Introduction.

Chapter Two : Classifier algorithms.

Chapter Three : Literature review.

Chapter Four : The proposed system architecture.

Chapter Five : Experiments and evaluation.

Chapter Six : Conclusion and future work.

References.

American Psychological Association (APA)

Abu Aqulah, Nibras Jamal. (2010). Improving the performance of Naïve Bayes algorithm for Arabic text categorization. (Master's theses Theses and Dissertations Master). Amman Arab University, Jordan
https://search.emarefa.net/detail/BIM-529116

Modern Language Association (MLA)

Abu Aqulah, Nibras Jamal. Improving the performance of Naïve Bayes algorithm for Arabic text categorization. (Master's theses Theses and Dissertations Master). Amman Arab University. (2010).
https://search.emarefa.net/detail/BIM-529116

American Medical Association (AMA)

Abu Aqulah, Nibras Jamal. (2010). Improving the performance of Naïve Bayes algorithm for Arabic text categorization. (Master's theses Theses and Dissertations Master). Amman Arab University, Jordan
https://search.emarefa.net/detail/BIM-529116

Language

English

Data Type

Arab Theses

Record ID

BIM-529116