Arabic opinion mining using parallel decision trees
العناوين الأخرى
التنقيب عن الآراء العربية باستخدام شجرة القرار المتوازية
مقدم أطروحة جامعية
مشرف أطروحة جامعية
أعضاء اللجنة
Radi, Muhammad Abd al-Latif
Awad Allah, Riwayah Fawzi
الجامعة
الجامعة الإسلامية
الكلية
كلية تكنولوجيا المعلومات
دولة الجامعة
فلسطين (قطاع غزة)
الدرجة العلمية
ماجستير
تاريخ الدرجة العلمية
2014
الملخص الإنجليزي
With the popularity of online shopping it is increasingly becoming important for manufacturers and service providers to ask customers to review their product and associated service.
Similarly, the number of customer reviews that a product receives grows rapidly and can be in hundreds or even thousands.
This makes it difficult for a potential customer to decide whether or not to buy the product.
It is also difficult for the manufacturer of the product to keep track and manage customer opinions.
Hence the importance stemmed opinion mining which is an emerging area of research, that summarizes the customer reviews of a product or service and express whether the opinions are positive or negative.
Various methods have been proposed as classifiers for opinion mining such as Naïve Bayesian, k-Nearest Neighbor techniques, and Support vector machine, the main drawback of these methods is classifying opinion without giving us the reasons about why the instance opinion is classified to certain class.
Therefore, in our work, we investigate opinion mining of Arabic text at the document level, by applying decision trees classification method to have clear, understandable rules.
In addition, we apply parallel decision trees classifiers to have efficient results.
We applied parallel decision trees on two Arabic corpus BHA and OCA of text.
To generate text representations, we apply some preprocessing operators such as Tokenize , filters Arabic stopwords, Stem Arabic, filters tokens based on their length, and filters tokens based on their content to exclude English words.
In case of applying parallel decision tree family on OCA, we get the best results of accuracy (93.83%) , f-measure (93.22) and consumed time 42 Sec at thread 4, which is greater than sequential that have accuracy (92.59%) and f-measure (92.58), and consumed time 68 Sec.
In case of applying parallel decision tree family on BHA we get the best results of accuracy (90.63%) , f-measure (82.29)and consumed time 219 Sec at thread 4, these results are different from sequential that have accuracy (90.70%) and f-measure (90.94), and consumed time 417 Sec.
التخصصات الرئيسية
تكنولوجيا المعلومات وعلم الحاسوب
الموضوعات
عدد الصفحات
76
قائمة المحتويات
Table of contents.
Abstract.
Abstract in Arabic.
Chapter One : Introduction.
Chapter Two : Related works.
Chapter Three : Arabic opinion mining and classification.
Chapter Four : Text data collection and preprocessing a collection and preprocessing.
Chapter Five : Experimental results and evaluation.
Chapter Six : Conclusion and future works.
References.
نمط استشهاد جمعية علماء النفس الأمريكية (APA)
Ahmad, Wafa Ala al-Din Mahmud. (2014). Arabic opinion mining using parallel decision trees. (Master's theses Theses and Dissertations Master). Islamic University, Palestine (Gaza Strip)
https://search.emarefa.net/detail/BIM-688598
نمط استشهاد الجمعية الأمريكية للغات الحديثة (MLA)
Ahmad, Wafa Ala al-Din Mahmud. Arabic opinion mining using parallel decision trees. (Master's theses Theses and Dissertations Master). Islamic University. (2014).
https://search.emarefa.net/detail/BIM-688598
نمط استشهاد الجمعية الطبية الأمريكية (AMA)
Ahmad, Wafa Ala al-Din Mahmud. (2014). Arabic opinion mining using parallel decision trees. (Master's theses Theses and Dissertations Master). Islamic University, Palestine (Gaza Strip)
https://search.emarefa.net/detail/BIM-688598
لغة النص
الإنجليزية
نوع البيانات
رسائل جامعية
رقم السجل
BIM-688598
قاعدة معامل التأثير والاستشهادات المرجعية العربي "ارسيف Arcif"
أضخم قاعدة بيانات عربية للاستشهادات المرجعية للمجلات العلمية المحكمة الصادرة في العالم العربي
تقوم هذه الخدمة بالتحقق من التشابه أو الانتحال في الأبحاث والمقالات العلمية والأطروحات الجامعية والكتب والأبحاث باللغة العربية، وتحديد درجة التشابه أو أصالة الأعمال البحثية وحماية ملكيتها الفكرية. تعرف اكثر