The combination of text classification system

Other Title(s)

نظام الدمج في تصنيف النصوص

Joint Authors

Jawad, Zainab M.
Khalaf, Zaynab Ali

Source

Journal of Basrah Researches : Sciences

Issue

Vol. 46, Issue 1 (31 Mar. 2020), pp.90-99, 10 p.

Publisher

University of Basrah College of Education for Pure Sciences

Publication Date

2020-03-31

Country of Publication

Iraq

No. of Pages

10

Main Subjects

Mathematics

Topics

Abstract EN

Text classification is considered an important task for arranging text data under labels or categories.

Real life data varies in language, size, format, noise and area.

Consequently, these data need to be classified accurately due to their sensitivity and importance.

Despite there being different classifiers that could be used, any individual classifier is still affected by issues and does not produce the best result for all data.

Therefore, the researchers directed to use a system combination to enhance the prediction result.

In this paper, a combination system is used to avoid the variation in the classifiers' performance, by finding the best prediction among three classifiers (Naïve Bayes, Support Vector Machine, and K-Nearest Neighbors) before giving the final decision, to achieve the required reliability and the efficiency.

Four data collections are used in English and Arabic languages: 20-newsgroup, Reuters-21578, Watan-2004 and Khalaf-2018.

The proposed system produces better results than the individual classifiers.

The F1-scores for the proposed system were 95%, 93%, 94% and 92% for 20-newsgroup, Reuters-21578, Watan-2004 and Khalaf-2018, respectively

American Psychological Association (APA)

Jawad, Zainab M.& Khalaf, Zaynab Ali. 2020. The combination of text classification system. Journal of Basrah Researches : Sciences،Vol. 46, no. 1, pp.90-99.
https://search.emarefa.net/detail/BIM-973024

Modern Language Association (MLA)

Jawad, Zainab M.& Khalaf, Zaynab Ali. The combination of text classification system. Journal of Basrah Researches : Sciences Vol. 46, no. 1 (2020), pp.90-99.
https://search.emarefa.net/detail/BIM-973024

American Medical Association (AMA)

Jawad, Zainab M.& Khalaf, Zaynab Ali. The combination of text classification system. Journal of Basrah Researches : Sciences. 2020. Vol. 46, no. 1, pp.90-99.
https://search.emarefa.net/detail/BIM-973024

Data Type

Journal Articles

Language

English

Notes

Includes bibliographical references : p. 97-98

Record ID

BIM-973024