Identifying Palestinian political content from Arabic tweets

العناوين الأخرى

تعريف المحتوى الفلسطيني السياسي من التغريدات العربية

مقدم أطروحة جامعية

al-Kurd, Husam Abd al-Raziq Ahmad

مشرف أطروحة جامعية

Barakah, Ribhi Sulayman

أعضاء اللجنة

al-Halis, Ala Mustafa Darwish
al-Sayigh, Sana Wafa

الجامعة

الجامعة الإسلامية

الكلية

كلية تكنولوجيا المعلومات

دولة الجامعة

فلسطين (قطاع غزة)

الدرجة العلمية

ماجستير

تاريخ الدرجة العلمية

2015

الملخص الإنجليزي

Using Twitter in news agencies and media has become widely popular, thus considerable proportion of tweets greatly reflects the social perspective in the real world.

Further many people follow the news on twitter which attracts the news agencies to analyze and try to know what is happening on Twitter.

Press and media agencies are looking to find efficient tools to analyze and classify tweets and this is due the difficulty and high cost of the manual approaches.

Various works have discussed and provided effective solutions for processing tweets into understandable formats for machines to classify and analyze.

While researches receive much attention in languages and locales such as English, some languages such as Arabic have not received much research attention despite the wide spread of Twitter in the Arab world in general and Palestine in particular.

In this thesis we propose an approach using machine learning to automatically classify Arabic tweets related to Palestinian political topics/content.

The purpose of classifying Palestinian Arabic political topics is that Palestine receives great attention in Arab news and social media.

The approach is based on collecting tweets using an application that we develop based on the TwitterPalPol.

It collects tweets from different Twitter API’s through specific factors like keywords, region and language.

Then we process the collected tweets and classify part of them manually as Palestinian political and as not Palestinian political in order to be used as the learning data set for the selected machine learning.

This is used in the algorithm to classify new tweets automatically.

In addition we create two datasets for learning, the first one includes all the collected tweets prepared for learning, and the second include filtered tweets with creditability filter created to evaluate the creditability for each tweet and ignore fake tweets.

The filter is dependent on many factors related to tweet properties, therefore we compare the results for the classification in both data sets and find out the importance of the filter.

The results was sufficient as they preserve ranges between 97% and 80% in the main classification measurers like recall and precision.

التخصصات الرئيسية

تكنولوجيا المعلومات وعلم الحاسوب

عدد الصفحات

67

قائمة المحتويات

Table of contents.

Abstract.

Abstract in Arabic.

Chapter One : Introduction.

Chapter Two : Related works.

Chapter Three : Theoretical and technical foundation.

Chapter Four : Approach for identifying political topics from tweets.

Chapter Five : Implementation and experiments.

Chapter Six : Results and evaluation.

Chapter Seven : Conclusion and future work.

References.

نمط استشهاد جمعية علماء النفس الأمريكية (APA)

al-Kurd, Husam Abd al-Raziq Ahmad. (2015). Identifying Palestinian political content from Arabic tweets. (Master's theses Theses and Dissertations Master). Islamic University, Palestine (Gaza Strip)
https://search.emarefa.net/detail/BIM-688696

نمط استشهاد الجمعية الأمريكية للغات الحديثة (MLA)

al-Kurd, Husam Abd al-Raziq Ahmad. Identifying Palestinian political content from Arabic tweets. (Master's theses Theses and Dissertations Master). Islamic University. (2015).
https://search.emarefa.net/detail/BIM-688696

نمط استشهاد الجمعية الطبية الأمريكية (AMA)

al-Kurd, Husam Abd al-Raziq Ahmad. (2015). Identifying Palestinian political content from Arabic tweets. (Master's theses Theses and Dissertations Master). Islamic University, Palestine (Gaza Strip)
https://search.emarefa.net/detail/BIM-688696

لغة النص

الإنجليزية

نوع البيانات

رسائل جامعية

رقم السجل

BIM-688696