filtering spam e-mail from mixed Arabic and English messages : A comparison of machine learning techniques

المؤلف

al-Halis, Ala Mustafa Darwish

المصدر

The International Arab Journal of Information Technology

العدد

المجلد 6، العدد 1 (31 يناير/كانون الثاني 2009)، ص ص. 52-59، 8ص.

الناشر

جامعة الزرقاء

تاريخ النشر

2009-01-31

دولة النشر

الأردن

عدد الصفحات

8

التخصصات الرئيسية

تكنولوجيا المعلومات وعلم الحاسوب

الموضوعات

الملخص EN

Spam is one of the main problems in emails communications.

As the volume of non-English language spam increases, little work is done in this area.

For example, in Arab world users receive spam written mostly in Arabic, English or mixed Arabic and English.

To filter this kind of messages, this research applied several machine learning techniques.

Many researchers have used machine learning techniques to filter spam email messages.

This study compared six supervised machine learning classifiers which are maximum entropy, decision trees, artificial neural nets, naïve bayes, support system machines and k-nearest neighbor.

The experiments suggested that words in Arabic messages should be stemmed before applying classifier.

In addition, in most cases, experiments showed that classifiers using feature selection techniques can achieve comparable or better performance than filters do not used them.

نمط استشهاد جمعية علماء النفس الأمريكية (APA)

al-Halis, Ala Mustafa Darwish. 2009. filtering spam e-mail from mixed Arabic and English messages : A comparison of machine learning techniques. The International Arab Journal of Information Technology،Vol. 6, no. 1, pp.52-59.
https://search.emarefa.net/detail/BIM-10472

نمط استشهاد الجمعية الأمريكية للغات الحديثة (MLA)

al-Halis, Ala Mustafa Darwish. filtering spam e-mail from mixed Arabic and English messages : A comparison of machine learning techniques. The International Arab Journal of Information Technology Vol. 6, no. 1 (Jan. 2009), pp.52-59.
https://search.emarefa.net/detail/BIM-10472

نمط استشهاد الجمعية الطبية الأمريكية (AMA)

al-Halis, Ala Mustafa Darwish. filtering spam e-mail from mixed Arabic and English messages : A comparison of machine learning techniques. The International Arab Journal of Information Technology. 2009. Vol. 6, no. 1, pp.52-59.
https://search.emarefa.net/detail/BIM-10472

نوع البيانات

مقالات

لغة النص

الإنجليزية

الملاحظات

Includes bibliographical references : p. 85-59

رقم السجل

BIM-10472