Cybercrime and authorship detection in very short texts a quantitative morpho-lexical approach

المؤلف

Umar, Abd al-Fattah

المصدر

Journal of Scientific Research in Arts

العدد

المجلد 2019، العدد 20، ج. 1 (31 يناير/كانون الثاني 2019)، ص ص. 1-25، 25ص.

الناشر

جامعة عين شمس كلية البنات للآداب و العلوم و التربية

تاريخ النشر

2019-01-31

دولة النشر

مصر

عدد الصفحات

25

التخصصات الرئيسية

الآداب

الموضوعات

الملخص EN

The present study proposes an integrated framework that considers letter-pair frequencies/combinations along with the lexical features of documents.

Drawing on a quantitative morpho-lexical approach, the study tests the hypothesis that letter information or mapping carries unique stylistic features; and therefore detecting stable word combinations and morphological patterns can be used to enhance the authorship performance in relation to very short texts.

The data used for analysis is a corpus of 12240 tweets derived from 87 Twitter accounts.

Self-organizing maps (SOMs) model is used for classifying the input patterns that share common features together as a clue that tweets grouped under one class membership are written by the same author.

Results indicate that the classification accuracy based on the proposed system is around 76% .

Up to 22% of this accuracy was lost, however, when only distinctive words were used, and 26% was lost when the classification performance was based on letter combinations and morphological patterns only.

The integration of letter-pairs and morphological patterns had the advantage of improving the accuracy of determining the author of a given tweet.

This indicates that the integration of different linguistic variables into an integrated system leads to a better classification performance of very short texts.

It is also clear that the use of the self-organizing map (SOM) led to better clustering performance for its capacity to integrate two different linguistic levels of each author profile together.

.

نمط استشهاد جمعية علماء النفس الأمريكية (APA)

Umar, Abd al-Fattah. 2019. Cybercrime and authorship detection in very short texts a quantitative morpho-lexical approach. Journal of Scientific Research in Arts،Vol. 2019, no. 20، ج. 1, pp.1-25.
https://search.emarefa.net/detail/BIM-1087162

نمط استشهاد الجمعية الأمريكية للغات الحديثة (MLA)

Umar, Abd al-Fattah. Cybercrime and authorship detection in very short texts a quantitative morpho-lexical approach. Journal of Scientific Research in Arts No. 20, p. 1 (2019), pp.1-25.
https://search.emarefa.net/detail/BIM-1087162

نمط استشهاد الجمعية الطبية الأمريكية (AMA)

Umar, Abd al-Fattah. Cybercrime and authorship detection in very short texts a quantitative morpho-lexical approach. Journal of Scientific Research in Arts. 2019. Vol. 2019, no. 20، ج. 1, pp.1-25.
https://search.emarefa.net/detail/BIM-1087162

نوع البيانات

مقالات

لغة النص

الإنجليزية

الملاحظات

Includes bibliographical references :

رقم السجل

BIM-1087162