Sentiment analysis based on probabilistic classifier techniques in various indonesian review data
Joint Authors
Hayatin, Nur
Alias, Thurayya
Hung, Lai Po
Sainin, Muhammad Shammari
Source
Jordanian Journal of Computetrs and Information Technology
Issue
Vol. 8, Issue 3 (30 Sep. 2022), pp.271-281, 11 p.
Publisher
Princess Sumaya University for Technology
Publication Date
2022-09-30
Country of Publication
Jordan
No. of Pages
11
Main Subjects
Information Technology and Computer Science
Abstract EN
Sentiment analysis is the field in data science to achieve a broader holistic view of users’ needs and expectations.
Indonesian user opinions have the potential to manage to be valuable information using sentiment-analysis tasks.
One of the most supervised-learning techniques used in Indonesian sentiment analysis is the Naïve Bayes classifier.
The classifier can be optimized and tuned in various models to increase the sentiment analysis model performance.
This research aims to examine the performance of various Naïve Bayes models in sentiment analysis, especially when implemented in small datasets to handle overfitting problems.
Four different Naïve Bayes models used are Gaussian, Multinomial, Complement and Bernoulli.
We also analyze the effect of various pre-processing techniques on the models’ performance.
Moreover, we build the first fashion dataset from the Indonesian marketplace which has a unique character compared to the datasets from other domains.
Finally, we also use various datasets in the experiment to test the Naïve Bayes models' performance.
From the experimental results, Complement Naïve Bayes is superior to other models, especially in handling overfitting with an F1-score of approximately 0.82.
American Psychological Association (APA)
Hayatin, Nur& Alias, Thurayya& Hung, Lai Po& Sainin, Muhammad Shammari. 2022. Sentiment analysis based on probabilistic classifier techniques in various indonesian review data. Jordanian Journal of Computetrs and Information Technology،Vol. 8, no. 3, pp.271-281.
https://search.emarefa.net/detail/BIM-1435888
Modern Language Association (MLA)
Hayatin, Nur…[et al.]. Sentiment analysis based on probabilistic classifier techniques in various indonesian review data. Jordanian Journal of Computetrs and Information Technology Vol. 8, no. 3 (Sep. 2022), pp.271-281.
https://search.emarefa.net/detail/BIM-1435888
American Medical Association (AMA)
Hayatin, Nur& Alias, Thurayya& Hung, Lai Po& Sainin, Muhammad Shammari. Sentiment analysis based on probabilistic classifier techniques in various indonesian review data. Jordanian Journal of Computetrs and Information Technology. 2022. Vol. 8, no. 3, pp.271-281.
https://search.emarefa.net/detail/BIM-1435888
Data Type
Journal Articles
Language
English
Notes
Includes margin notes.
Record ID
BIM-1435888