The combination of text classification system
Other Title(s)
نظام الدمج في تصنيف النصوص
Joint Authors
Jawad, Zainab M.
Khalaf, Zaynab Ali
Source
Journal of Basrah Researches : Sciences
Issue
Vol. 46, Issue 1 (31 Mar. 2020), pp.90-99, 10 p.
Publisher
University of Basrah College of Education for Pure Sciences
Publication Date
2020-03-31
Country of Publication
Iraq
No. of Pages
10
Main Subjects
Topics
Abstract EN
Text classification is considered an important task for arranging text data under labels or categories.
Real life data varies in language, size, format, noise and area.
Consequently, these data need to be classified accurately due to their sensitivity and importance.
Despite there being different classifiers that could be used, any individual classifier is still affected by issues and does not produce the best result for all data.
Therefore, the researchers directed to use a system combination to enhance the prediction result.
In this paper, a combination system is used to avoid the variation in the classifiers' performance, by finding the best prediction among three classifiers (Naïve Bayes, Support Vector Machine, and K-Nearest Neighbors) before giving the final decision, to achieve the required reliability and the efficiency.
Four data collections are used in English and Arabic languages: 20-newsgroup, Reuters-21578, Watan-2004 and Khalaf-2018.
The proposed system produces better results than the individual classifiers.
The F1-scores for the proposed system were 95%, 93%, 94% and 92% for 20-newsgroup, Reuters-21578, Watan-2004 and Khalaf-2018, respectively
American Psychological Association (APA)
Jawad, Zainab M.& Khalaf, Zaynab Ali. 2020. The combination of text classification system. Journal of Basrah Researches : Sciences،Vol. 46, no. 1, pp.90-99.
https://search.emarefa.net/detail/BIM-973024
Modern Language Association (MLA)
Jawad, Zainab M.& Khalaf, Zaynab Ali. The combination of text classification system. Journal of Basrah Researches : Sciences Vol. 46, no. 1 (2020), pp.90-99.
https://search.emarefa.net/detail/BIM-973024
American Medical Association (AMA)
Jawad, Zainab M.& Khalaf, Zaynab Ali. The combination of text classification system. Journal of Basrah Researches : Sciences. 2020. Vol. 46, no. 1, pp.90-99.
https://search.emarefa.net/detail/BIM-973024
Data Type
Journal Articles
Language
English
Notes
Includes bibliographical references : p. 97-98
Record ID
BIM-973024