Identifying informative coronavirus tweets using recurrent neural network document embedding

العناوين الأخرى

تصنيف الأخبار المسهبة حول وباء الكورونا المستجد عبر تويتر باستخدام تضمين الملفات النصية بواسطة خوارزمية الشبكات العصبية المتكررة

تاريخ النشر

2022-03-31

دولة النشر

فلسطين (الضفة الغربية)

عدد الصفحات

التخصصات الرئيسية

تكنولوجيا المعلومات وعلم الحاسوب

الملخص AR

أدى ظهور جائحة كورونا إلى انتشار عدد هائل من أخبار كاذبة و معلومات مضللة عبر التغريدات، و من ثم فان هناك حاجة مهمة تتمثل في تصنيف التغريدات إلى تغريدات مسببة إعلاميا و غير مسببة إعلاميا و هذا الامر حفز الباحثين على استخدام تقنيات التعلم الآلي لمعالجة هذا الموضوع و قد أظهرت الدراسات الحديثة اعتمادا كبيرا على خوارزميات الشبكة العصبية المتحورة و مع ذلك فهي تعاني من مشكلة تدعى "النسيان الكارثي" حيث تفقد معلومات سياقية خلال عملية التدريب.

لذلك هذه الدراسة تهدف إلى اقتراح آلية لتضمين النصوص بناءا على الشبكة العصبية المتكررة أخيرا، تم استخدام ثلاثة مصنفات من LR و SVM و MLP لتصنيف المستندات إلى مسهبة إعلاميا و غير مسهبة إعلاميا، باستخدام مجموعة البيانات المعيارية ل 2020-WNUT في المهمة 2 ، حصل مصنف LR على أعلى مقياس يبلغ 0.91.

توضح هذه النتيجة فعالية RNN في توليد اليات تضمين نصوص مركبة.

الملخص EN

The coronavirus pandemic has led to the spread of tremendous fake news and misleading information through tweets.

hence, an interesting task of classifying tweets into informative and uninformative has motivated researchers to employ machine learning techniques.

the state-of-the-art studies showed high dependency on transformers architecture.

however, the transformers architecture suffers from the catastrophic forgetting problem where important contextual information is being forgotten by the gradients.

therefore, this paper proposes a document embedding using recurrent neural network.

Lastly, three classifiers of LR, SVM and MLP have been used to classify documents into informative and uninformative.

using the benchmark dataset of WNUT-2020 at Task 2, LR classifier obtained the highest f-measure of 0.91.

this result demonstrates the efficacy of RNN to generate sophisticated document embedding.

نمط استشهاد جمعية علماء النفس الأمريكية (APA)

Yusuf, Rami Naim Muhammad. 2022. Identifying informative coronavirus tweets using recurrent neural network document embedding. Palestine Technical University Kadoorie Research Journal،Vol. 10, no. 1, pp.93-102.
https://search.emarefa.net/detail/BIM-1505794

نمط استشهاد الجمعية الأمريكية للغات الحديثة (MLA)

Yusuf, Rami Naim Muhammad. Identifying informative coronavirus tweets using recurrent neural network document embedding. Palestine Technical University Kadoorie Research Journal Vol. 10, no. 1 (2022), pp.93-102.
https://search.emarefa.net/detail/BIM-1505794

نمط استشهاد الجمعية الطبية الأمريكية (AMA)

Yusuf, Rami Naim Muhammad. Identifying informative coronavirus tweets using recurrent neural network document embedding. Palestine Technical University Kadoorie Research Journal. 2022. Vol. 10, no. 1, pp.93-102.
https://search.emarefa.net/detail/BIM-1505794

نوع البيانات

مقالات

لغة النص

الإنجليزية

الملاحظات

Includes bibliographical references : p. 101-102

رقم السجل

BIM-1505794

حفظتم الحفظ طباعة

قاعدة معامل التأثير والاستشهادات المرجعية العربي "ارسيف Arcif"

أضخم قاعدة بيانات عربية للاستشهادات المرجعية للمجلات العلمية المحكمة الصادرة في العالم العربي

مرصد "معرفة"
لقياس الإنتاج العلمي العربي

تقوم هذه الخدمة بالتحقق من التشابه أو الانتحال في الأبحاث والمقالات العلمية والأطروحات الجامعية والكتب والأبحاث باللغة العربية، وتحديد درجة التشابه أو أصالة الأعمال البحثية وحماية ملكيتها الفكرية. تعرف اكثر