Arabic cyberbullying detection using Arabic sentiment analysis
Other Title(s)
كشف التنمر الإلكتروني باللغة العربية باستخدام تحليل المشاعر العربية
Joint Authors
al-Mutayri, Samar
Ahmad, Muhammad Abd al-Fattah Ibrahim
Source
The Egyptian Journal of Language Engineering
Issue
Vol. 8, Issue 1 (30 Apr. 2021), pp.39-50, 12 p.
Publisher
Egyptian Society of Language Engineering
Publication Date
2021-04-30
Country of Publication
Egypt
No. of Pages
12
Main Subjects
Information Technology and Computer Science
Topics
Abstract EN
The Sentiment Analysis is used for the text analysing, and classification of the text attitude.
We are using the computing advancement in the form of Machine Learning (ML) and Support Vector Machine (SVM) algorithm to train a dataset which is collected automatically through ArabiTools and Twitter API.
The dataset contents are labelled by both means, automatic and manual, in order to maintain the efficiency of the detection of CyberBullying tweets.
The dataset is automatically labelled with respect to the nature of the tweet.
If a tweet contains one or more CyberBullying words, it is labelled as CyberBullying, while if there isn't any word with aggressive meaning found, it is marked as the NonCyberBullying.
After the data collection, there are several pre-processing techniques utilized, including the Normalization, Tokenization, Light Stemmer, ArabicStemmerKhoja, and Term Frequency-Inverse Document Frequency (TF-IDF)” term weighting schema.
” After the preliminary steps, (SVM), a “supervised algorithm, ” is used with WEKA and Python.
There are three experiments that take place one with the WEKA tool using the Light Stemmer, the other is again with WEKA using ArabicStemmerKhoja, and the final experiment was performed with Python.
The results are showing the WEKA is more efficient in classifying the text correctly, while Python is more effective with time to build the model.
WEKA using the Light Stemmer have the efficiency of 85.49% and taken 352.51 seconds, and the WEKA using ArabicStemmerKhoja have the efficiency of 85.38% and taken 212.12 seconds, while the Python have the efficiency of 84.03% and taken 142.68 seconds
American Psychological Association (APA)
al-Mutayri, Samar& Ahmad, Muhammad Abd al-Fattah Ibrahim. 2021. Arabic cyberbullying detection using Arabic sentiment analysis. The Egyptian Journal of Language Engineering،Vol. 8, no. 1, pp.39-50.
https://search.emarefa.net/detail/BIM-1254894
Modern Language Association (MLA)
al-Mutayri, Samar& Ahmad, Muhammad Abd al-Fattah Ibrahim. Arabic cyberbullying detection using Arabic sentiment analysis. The Egyptian Journal of Language Engineering Vol. 8, no. 1 (Apr. 2021), pp.39-50.
https://search.emarefa.net/detail/BIM-1254894
American Medical Association (AMA)
al-Mutayri, Samar& Ahmad, Muhammad Abd al-Fattah Ibrahim. Arabic cyberbullying detection using Arabic sentiment analysis. The Egyptian Journal of Language Engineering. 2021. Vol. 8, no. 1, pp.39-50.
https://search.emarefa.net/detail/BIM-1254894
Data Type
Journal Articles
Language
English
Notes
-
Record ID
BIM-1254894