English text classification using improved recursive feature elimination (IRFE)‎ algorithm

العناوين الأخرى

تصنيف النص الإنجليزي باستخدام الخوارزمية العودية المحسنة لإزالة الخواص (IRFE)‎

المؤلفون المشاركون

Ulaywi, Ahmad Husayn
Abd al-Amir, Isra Husayn

المصدر

Journal of Engineering Sciences and Information Technology

العدد

المجلد 4، العدد 2 (30 يونيو/حزيران 2020)، ص ص. 110-120، 11ص.

الناشر

المركز القومي للبحوث

تاريخ النشر

2020-06-30

دولة النشر

فلسطين (قطاع غزة)

عدد الصفحات

11

التخصصات الرئيسية

الرياضيات

الملخص EN

Documents classification is from most important fields for Natural language processing and text mining.

There are many algorithms can be used for this task.

In this paper, focuses on improving Text Classification by feature selection.

This means determine some of the original features without affecting the accuracy of the work, where our work is a new feature selection method was suggested which can be a general formulation and mathematical model of Recursive Feature Elimination (RFE).

The used method was compared with other two well-known feature selection methods: Chi-square and threshold.

The results proved that the new method is comparable with the other methods, The best results were 83% when 60% of features used, 82% when 40% of features used, and 82% when 20% of features used.

The tests were done with the Naïve Bayes (NB) and decision tree (DT) classification algorithms , where the used dataset is a well-known English data set “20 newsgroups text” consists of approximately 18846 files.

The results showed that our suggested feature selection method is comparable with standard Like Chi-square.

نمط استشهاد جمعية علماء النفس الأمريكية (APA)

Abd al-Amir, Isra Husayn& Ulaywi, Ahmad Husayn. 2020. English text classification using improved recursive feature elimination (IRFE) algorithm. Journal of Engineering Sciences and Information Technology،Vol. 4, no. 2, pp.110-120.
https://search.emarefa.net/detail/BIM-1279258

نمط استشهاد الجمعية الأمريكية للغات الحديثة (MLA)

Abd al-Amir, Isra Husayn& Ulaywi, Ahmad Husayn. English text classification using improved recursive feature elimination (IRFE) algorithm. Journal of Engineering Sciences and Information Technology Vol. 4, no. 2 (Jun. 2020), pp.110-120.
https://search.emarefa.net/detail/BIM-1279258

نمط استشهاد الجمعية الطبية الأمريكية (AMA)

Abd al-Amir, Isra Husayn& Ulaywi, Ahmad Husayn. English text classification using improved recursive feature elimination (IRFE) algorithm. Journal of Engineering Sciences and Information Technology. 2020. Vol. 4, no. 2, pp.110-120.
https://search.emarefa.net/detail/BIM-1279258

نوع البيانات

مقالات

لغة النص

الإنجليزية

الملاحظات

Includes bibliographical references : p. 118-120

رقم السجل

BIM-1279258