Arabic cyberbullying detection using Arabic sentiment analysis

Other Title(s)

كشف التنمر الإلكتروني باللغة العربية باستخدام تحليل المشاعر العربية

Time cited in Arcif : 
1

Joint Authors

al-Mutayri, Samar
Ahmad, Muhammad Abd al-Fattah Ibrahim

Source

The Egyptian Journal of Language Engineering

Issue

Vol. 8, Issue 1 (30 Apr. 2021), pp.39-50, 12 p.

Publisher

Egyptian Society of Language Engineering

Publication Date

2021-04-30

Country of Publication

Egypt

No. of Pages

12

Main Subjects

Information Technology and Computer Science

Topics

Abstract EN

The Sentiment Analysis is used for the text analysing, and classification of the text attitude.

We are using the computing advancement in the form of Machine Learning (ML) and Support Vector Machine (SVM) algorithm to train a dataset which is collected automatically through ArabiTools and Twitter API.

The dataset contents are labelled by both means, automatic and manual, in order to maintain the efficiency of the detection of CyberBullying tweets.

The dataset is automatically labelled with respect to the nature of the tweet.

If a tweet contains one or more CyberBullying words, it is labelled as CyberBullying, while if there isn't any word with aggressive meaning found, it is marked as the NonCyberBullying.

After the data collection, there are several pre-processing techniques utilized, including the Normalization, Tokenization, Light Stemmer, ArabicStemmerKhoja, and Term Frequency-Inverse Document Frequency (TF-IDF)” term weighting schema.

” After the preliminary steps, (SVM), a “supervised algorithm, ” is used with WEKA and Python.

There are three experiments that take place one with the WEKA tool using the Light Stemmer, the other is again with WEKA using ArabicStemmerKhoja, and the final experiment was performed with Python.

The results are showing the WEKA is more efficient in classifying the text correctly, while Python is more effective with time to build the model.

WEKA using the Light Stemmer have the efficiency of 85.49% and taken 352.51 seconds, and the WEKA using ArabicStemmerKhoja have the efficiency of 85.38% and taken 212.12 seconds, while the Python have the efficiency of 84.03% and taken 142.68 seconds

American Psychological Association (APA)

al-Mutayri, Samar& Ahmad, Muhammad Abd al-Fattah Ibrahim. 2021. Arabic cyberbullying detection using Arabic sentiment analysis. The Egyptian Journal of Language Engineering،Vol. 8, no. 1, pp.39-50.
https://search.emarefa.net/detail/BIM-1254894

Modern Language Association (MLA)

al-Mutayri, Samar& Ahmad, Muhammad Abd al-Fattah Ibrahim. Arabic cyberbullying detection using Arabic sentiment analysis. The Egyptian Journal of Language Engineering Vol. 8, no. 1 (Apr. 2021), pp.39-50.
https://search.emarefa.net/detail/BIM-1254894

American Medical Association (AMA)

al-Mutayri, Samar& Ahmad, Muhammad Abd al-Fattah Ibrahim. Arabic cyberbullying detection using Arabic sentiment analysis. The Egyptian Journal of Language Engineering. 2021. Vol. 8, no. 1, pp.39-50.
https://search.emarefa.net/detail/BIM-1254894

Data Type

Journal Articles

Language

English

Notes

-

Record ID

BIM-1254894