Summarizing Online Movie Reviews: A Machine Learning Approach to Big Data Analytics

Joint Authors

Gul, Muhammad Adnan
Uddin, M. Irfan
Khan, Atif
Shah, Syed Atif Ali
Ahmad, Shafiq
Al Firdausi, Muhammad Dzulqarnain
Zaindin, Mazen

Source

Scientific Programming

Issue

Vol. 2020, Issue 2020 (31 Dec. 2020), pp.1-14, 14 p.

Publisher

Hindawi Publishing Corporation

Publication Date

2020-08-01

Country of Publication

Egypt

No. of Pages

14

Main Subjects

Mathematics

Abstract EN

Information is exploding on the web at exponential pace, so online movie review is becoming a substantial information resource for online users.

However, users post millions of movie reviews on regular basis, and it is not possible for users to summarize the reviews.

Movie review classification and summarization is one of the challenging tasks in natural language processing.

Therefore, an automatic approach is demanded to summarize the vast amount of movie reviews, and it will allow the users to speedily distinguish the positive and negative aspects of a movie.

This study has proposed an approach for movie review classification and summarization.

For movie review classification, bag-of-words feature extraction technique is used to extract unigrams, bigrams, and trigrams as a feature set from given review documents, and represent the review documents as a vector space model.

Next, the Naïve Bayes algorithm is employed to classify the movie reviews (represented as a feature vector) into positive and negative reviews.

For the task of movie review summarization, Word2vec feature extraction technique is used to extract features from classified movie review sentences, and then semantic clustering technique is used to cluster semantically related review sentences.

Different text features are used to calculate the salience score of each review sentence in clusters.

Finally, the top-ranked sentences are chosen based on highest salience scores to produce the extractive summary of movie reviews.

Experimental results reveal that the proposed machine learning approach is superior than other state-of-the-art approaches.

American Psychological Association (APA)

Khan, Atif& Gul, Muhammad Adnan& Uddin, M. Irfan& Shah, Syed Atif Ali& Ahmad, Shafiq& Al Firdausi, Muhammad Dzulqarnain…[et al.]. 2020. Summarizing Online Movie Reviews: A Machine Learning Approach to Big Data Analytics. Scientific Programming،Vol. 2020, no. 2020, pp.1-14.
https://search.emarefa.net/detail/BIM-1209052

Modern Language Association (MLA)

Khan, Atif…[et al.]. Summarizing Online Movie Reviews: A Machine Learning Approach to Big Data Analytics. Scientific Programming No. 2020 (2020), pp.1-14.
https://search.emarefa.net/detail/BIM-1209052

American Medical Association (AMA)

Khan, Atif& Gul, Muhammad Adnan& Uddin, M. Irfan& Shah, Syed Atif Ali& Ahmad, Shafiq& Al Firdausi, Muhammad Dzulqarnain…[et al.]. Summarizing Online Movie Reviews: A Machine Learning Approach to Big Data Analytics. Scientific Programming. 2020. Vol. 2020, no. 2020, pp.1-14.
https://search.emarefa.net/detail/BIM-1209052

Data Type

Journal Articles

Language

English

Notes

Includes bibliographical references

Record ID

BIM-1209052