Summarizing Online Movie Reviews: A Machine Learning Approach to Big Data Analytics

المؤلفون المشاركون

Gul, Muhammad Adnan
Uddin, M. Irfan
Khan, Atif
Shah, Syed Atif Ali
Ahmad, Shafiq
Al Firdausi, Muhammad Dzulqarnain
Zaindin, Mazen

المصدر

Scientific Programming

العدد

المجلد 2020، العدد 2020 (31 ديسمبر/كانون الأول 2020)، ص ص. 1-14، 14ص.

الناشر

Hindawi Publishing Corporation

تاريخ النشر

2020-08-01

دولة النشر

مصر

عدد الصفحات

14

التخصصات الرئيسية

الرياضيات

الملخص EN

Information is exploding on the web at exponential pace, so online movie review is becoming a substantial information resource for online users.

However, users post millions of movie reviews on regular basis, and it is not possible for users to summarize the reviews.

Movie review classification and summarization is one of the challenging tasks in natural language processing.

Therefore, an automatic approach is demanded to summarize the vast amount of movie reviews, and it will allow the users to speedily distinguish the positive and negative aspects of a movie.

This study has proposed an approach for movie review classification and summarization.

For movie review classification, bag-of-words feature extraction technique is used to extract unigrams, bigrams, and trigrams as a feature set from given review documents, and represent the review documents as a vector space model.

Next, the Naïve Bayes algorithm is employed to classify the movie reviews (represented as a feature vector) into positive and negative reviews.

For the task of movie review summarization, Word2vec feature extraction technique is used to extract features from classified movie review sentences, and then semantic clustering technique is used to cluster semantically related review sentences.

Different text features are used to calculate the salience score of each review sentence in clusters.

Finally, the top-ranked sentences are chosen based on highest salience scores to produce the extractive summary of movie reviews.

Experimental results reveal that the proposed machine learning approach is superior than other state-of-the-art approaches.

نمط استشهاد جمعية علماء النفس الأمريكية (APA)

Khan, Atif& Gul, Muhammad Adnan& Uddin, M. Irfan& Shah, Syed Atif Ali& Ahmad, Shafiq& Al Firdausi, Muhammad Dzulqarnain…[et al.]. 2020. Summarizing Online Movie Reviews: A Machine Learning Approach to Big Data Analytics. Scientific Programming،Vol. 2020, no. 2020, pp.1-14.
https://search.emarefa.net/detail/BIM-1209052

نمط استشهاد الجمعية الأمريكية للغات الحديثة (MLA)

Khan, Atif…[et al.]. Summarizing Online Movie Reviews: A Machine Learning Approach to Big Data Analytics. Scientific Programming No. 2020 (2020), pp.1-14.
https://search.emarefa.net/detail/BIM-1209052

نمط استشهاد الجمعية الطبية الأمريكية (AMA)

Khan, Atif& Gul, Muhammad Adnan& Uddin, M. Irfan& Shah, Syed Atif Ali& Ahmad, Shafiq& Al Firdausi, Muhammad Dzulqarnain…[et al.]. Summarizing Online Movie Reviews: A Machine Learning Approach to Big Data Analytics. Scientific Programming. 2020. Vol. 2020, no. 2020, pp.1-14.
https://search.emarefa.net/detail/BIM-1209052

نوع البيانات

مقالات

لغة النص

الإنجليزية

الملاحظات

Includes bibliographical references

رقم السجل

BIM-1209052