USAD: An Intelligent System for Slang and Abusive Text Detection in PERSO-Arabic-Scripted Urdu

المؤلفون المشاركون

Haq, Nauman Ul
Ullah, Mohib
Khan, Rafiullah
Ahmad, Arshad
Almogren, Ahmad
Hayat, Bashir
Shafi, Bushra

المصدر

Complexity

العدد

المجلد 2020، العدد 2020 (31 ديسمبر/كانون الأول 2020)، ص ص. 1-7، 7ص.

الناشر

Hindawi Publishing Corporation

تاريخ النشر

2020-11-30

دولة النشر

مصر

عدد الصفحات

7

التخصصات الرئيسية

الفلسفة

الملخص EN

The use of slang, abusive, and offensive language has become common practice on social media.

Even though social media companies have censorship polices for slang, abusive, vulgar, and offensive language, due to limited resources and research in the automatic detection of abusive language mechanisms other than English, this condemnable act is still practiced.

This study proposes USAD (Urdu Slang and Abusive words Detection), a lexicon-based intelligent framework to detect abusive and slang words in Perso-Arabic-scripted Urdu Tweets.

Furthermore, due to the nonavailability of the standard dataset, we also design and annotate a dataset of abusive, offensive, and slang word Perso-Arabic-scripted Urdu as our second significant contribution for future research.

The results show that our proposed USAD model can identify 72.6% correctly as abusive or nonabusive Tweet.

Additionally, we have also identified some key factors that can help the researchers improve their abusive language detection models.

نمط استشهاد جمعية علماء النفس الأمريكية (APA)

Haq, Nauman Ul& Ullah, Mohib& Khan, Rafiullah& Ahmad, Arshad& Almogren, Ahmad& Hayat, Bashir…[et al.]. 2020. USAD: An Intelligent System for Slang and Abusive Text Detection in PERSO-Arabic-Scripted Urdu. Complexity،Vol. 2020, no. 2020, pp.1-7.
https://search.emarefa.net/detail/BIM-1143232

نمط استشهاد الجمعية الأمريكية للغات الحديثة (MLA)

Haq, Nauman Ul…[et al.]. USAD: An Intelligent System for Slang and Abusive Text Detection in PERSO-Arabic-Scripted Urdu. Complexity No. 2020 (2020), pp.1-7.
https://search.emarefa.net/detail/BIM-1143232

نمط استشهاد الجمعية الطبية الأمريكية (AMA)

Haq, Nauman Ul& Ullah, Mohib& Khan, Rafiullah& Ahmad, Arshad& Almogren, Ahmad& Hayat, Bashir…[et al.]. USAD: An Intelligent System for Slang and Abusive Text Detection in PERSO-Arabic-Scripted Urdu. Complexity. 2020. Vol. 2020, no. 2020, pp.1-7.
https://search.emarefa.net/detail/BIM-1143232

نوع البيانات

مقالات

لغة النص

الإنجليزية

الملاحظات

Includes bibliographical references

رقم السجل

BIM-1143232