المؤلف

al-Duwayri, Rehab

المصدر

The International Arab Journal of Information Technology

العدد

المجلد 4، العدد 2 (30 إبريل/نيسان 2007)، ص ص. 125-131، 7ص.

الناشر

جامعة الزرقاء

تاريخ النشر

2007-04-30

دولة النشر

الأردن

عدد الصفحات

7

التخصصات الرئيسية

تكنولوجيا المعلومات وعلم الحاسوب

الموضوعات

الملخص EN

In this paper, we compare the performance of three classifiers for Arabic text categorization.

In particular, the naïve Bayes, k-nearest-neighbors (knn), and distance-based classifiers were used.

Unclassified documents were preprocessed by removing punctuation marks and stop words.

Each document is then represented as a vector of words (or of words and their frequencies as in the case of the naïve Bayes classifier).

Stemming was used to reduce the dimensionality of feature vectors of documents.

The accuracy of the classifiers is compared using recall, precision, error rate and fallout.

The results of the experimentations that were carried out on an in-house collected Arabic text show that the naïve Bayes classifier outperforms the other two.

نمط استشهاد جمعية علماء النفس الأمريكية (APA)

al-Duwayri, Rehab. 2007. Arabic text categorization. The International Arab Journal of Information Technology،Vol. 4, no. 2, pp.125-131.
https://search.emarefa.net/detail/BIM-11633

نمط استشهاد الجمعية الأمريكية للغات الحديثة (MLA)

al-Duwayri, Rehab. Arabic text categorization. The International Arab Journal of Information Technology Vol. 4, no. 2 (Apr. 2007), pp.125-131.
https://search.emarefa.net/detail/BIM-11633

نمط استشهاد الجمعية الطبية الأمريكية (AMA)

al-Duwayri, Rehab. Arabic text categorization. The International Arab Journal of Information Technology. 2007. Vol. 4, no. 2, pp.125-131.
https://search.emarefa.net/detail/BIM-11633

نوع البيانات

مقالات

لغة النص

الإنجليزية

الملاحظات

Includes bibliographical references : p. 129-130

رقم السجل

BIM-11633