Arabic opinion mining using parallel decision trees

Other Title(s)

التنقيب عن الآراء العربية باستخدام شجرة القرار المتوازية

Dissertant

Ahmad, Wafa Ala al-Din Mahmud

Thesis advisor

al-Halis, Ala Mustafa

Comitee Members

Radi, Muhammad Abd al-Latif
Awad Allah, Riwayah Fawzi

University

Islamic University

Faculty

Faculty of Information Technology

University Country

Palestine (Gaza Strip)

Degree

Master

Degree Date

2014

English Abstract

With the popularity of online shopping it is increasingly becoming important for manufacturers and service providers to ask customers to review their product and associated service.

Similarly, the number of customer reviews that a product receives grows rapidly and can be in hundreds or even thousands.

This makes it difficult for a potential customer to decide whether or not to buy the product.

It is also difficult for the manufacturer of the product to keep track and manage customer opinions.

Hence the importance stemmed opinion mining which is an emerging area of research, that summarizes the customer reviews of a product or service and express whether the opinions are positive or negative.

Various methods have been proposed as classifiers for opinion mining such as Naïve Bayesian, k-Nearest Neighbor techniques, and Support vector machine, the main drawback of these methods is classifying opinion without giving us the reasons about why the instance opinion is classified to certain class.

Therefore, in our work, we investigate opinion mining of Arabic text at the document level, by applying decision trees classification method to have clear, understandable rules.

In addition, we apply parallel decision trees classifiers to have efficient results.

We applied parallel decision trees on two Arabic corpus BHA and OCA of text.

To generate text representations, we apply some preprocessing operators such as Tokenize , filters Arabic stopwords, Stem Arabic, filters tokens based on their length, and filters tokens based on their content to exclude English words.

In case of applying parallel decision tree family on OCA, we get the best results of accuracy (93.83%) , f-measure (93.22) and consumed time 42 Sec at thread 4, which is greater than sequential that have accuracy (92.59%) and f-measure (92.58), and consumed time 68 Sec.

In case of applying parallel decision tree family on BHA we get the best results of accuracy (90.63%) , f-measure (82.29)and consumed time 219 Sec at thread 4, these results are different from sequential that have accuracy (90.70%) and f-measure (90.94), and consumed time 417 Sec.

Main Subjects

Information Technology and Computer Science

Topics

No. of Pages

76

Table of Contents

Table of contents.

Abstract.

Abstract in Arabic.

Chapter One : Introduction.

Chapter Two : Related works.

Chapter Three : Arabic opinion mining and classification.

Chapter Four : Text data collection and preprocessing a collection and preprocessing.

Chapter Five : Experimental results and evaluation.

Chapter Six : Conclusion and future works.

References.

American Psychological Association (APA)

Ahmad, Wafa Ala al-Din Mahmud. (2014). Arabic opinion mining using parallel decision trees. (Master's theses Theses and Dissertations Master). Islamic University, Palestine (Gaza Strip)
https://search.emarefa.net/detail/BIM-688598

Modern Language Association (MLA)

Ahmad, Wafa Ala al-Din Mahmud. Arabic opinion mining using parallel decision trees. (Master's theses Theses and Dissertations Master). Islamic University. (2014).
https://search.emarefa.net/detail/BIM-688598

American Medical Association (AMA)

Ahmad, Wafa Ala al-Din Mahmud. (2014). Arabic opinion mining using parallel decision trees. (Master's theses Theses and Dissertations Master). Islamic University, Palestine (Gaza Strip)
https://search.emarefa.net/detail/BIM-688598

Language

English

Data Type

Arab Theses

Record ID

BIM-688598