Intelligent documents classification system

Other Title(s)

نظام تصنيف الوثائق الذكي

Joint Authors

Hasan, Halah Diya
Abd Allah, Hasanayn Samir

Source

al-Mansour

Issue

Vol. 2019, Issue 31 (30 Jun. 2019), pp.134-151, 18 p.

Publisher

al-Mansour University College

Publication Date

2019-06-30

Country of Publication

Iraq

No. of Pages

18

Main Subjects

Information Technology and Computer Science

Topics

Abstract EN

There are a huge number of documents that available in many various sources in unorganized format, therefore these unstructured documents needs to be classified.

In this paper, a proposed system called "Intelligent Documents Classification System" which represents the system for classifying the documents to the correct class based on its textual information.

This system contain through four steps which are preprocessing, features extraction, proposed method for features selection, and finally, modify model of naïve bays.

Two datasets are used to evaluate the proposed system, the first dataset its name as "bbc from ucd repository" is standard that contains technical research documents distributed over five classes which available on the internet and the second dataset is collected dataset contains books documents distributed over six classes which collected during this work.

The IDC system achieved the powerful results.

For the standard dataset the accuracy is 95.1%, precision is 95%, recall is 95.8%, and f1-measure is 95.39% while the accuracy for the collected dataset is 95.3%, precision is 95.16%, recall is 95.83%, and f1-measure is 95.49%.

American Psychological Association (APA)

Abd Allah, Hasanayn Samir& Hasan, Halah Diya. 2019. Intelligent documents classification system. al-Mansour،Vol. 2019, no. 31, pp.134-151.
https://search.emarefa.net/detail/BIM-947689

Modern Language Association (MLA)

Abd Allah, Hasanayn Samir& Hasan, Halah Diya. Intelligent documents classification system. al-Mansour No. 31 (2019), pp.134-151.
https://search.emarefa.net/detail/BIM-947689

American Medical Association (AMA)

Abd Allah, Hasanayn Samir& Hasan, Halah Diya. Intelligent documents classification system. al-Mansour. 2019. Vol. 2019, no. 31, pp.134-151.
https://search.emarefa.net/detail/BIM-947689

Data Type

Journal Articles

Language

English

Notes

Includes bibliographical references : p. 149-150

Record ID

BIM-947689