Improved Feature Weight Algorithm and Its Application to Text Classification
Joint Authors
Shang, Songtao
Shi, Minyong
Shang, Wenqian
Hong, Zhiguo
Source
Mathematical Problems in Engineering
Issue
Vol. 2016, Issue 2016 (31 Dec. 2016), pp.1-12, 12 p.
Publisher
Hindawi Publishing Corporation
Publication Date
2016-03-28
Country of Publication
Egypt
No. of Pages
12
Main Subjects
Abstract EN
Text preprocessing is one of the key problems in pattern recognition and plays an important role in the process of text classification.
Text preprocessing has two pivotal steps: feature selection and feature weighting.
The preprocessing results can directly affect the classifiers’ accuracy and performance.
Therefore, choosing the appropriate algorithm for feature selection and feature weighting to preprocess the document can greatly improve the performance of classifiers.
According to the Gini Index theory, this paper proposes an Improved Gini Index algorithm.
This algorithm constructs a new feature selection and feature weighting function.
The experimental results show that this algorithm can improve the classifiers’ performance effectively.
At the same time, this algorithm is applied to a sensitive information identification system and has achieved a good result.
The algorithm’s precision and recall are higher than those of traditional ones.
It can identify sensitive information on the Internet effectively.
American Psychological Association (APA)
Shang, Songtao& Shi, Minyong& Shang, Wenqian& Hong, Zhiguo. 2016. Improved Feature Weight Algorithm and Its Application to Text Classification. Mathematical Problems in Engineering،Vol. 2016, no. 2016, pp.1-12.
https://search.emarefa.net/detail/BIM-1112608
Modern Language Association (MLA)
Shang, Songtao…[et al.]. Improved Feature Weight Algorithm and Its Application to Text Classification. Mathematical Problems in Engineering No. 2016 (2016), pp.1-12.
https://search.emarefa.net/detail/BIM-1112608
American Medical Association (AMA)
Shang, Songtao& Shi, Minyong& Shang, Wenqian& Hong, Zhiguo. Improved Feature Weight Algorithm and Its Application to Text Classification. Mathematical Problems in Engineering. 2016. Vol. 2016, no. 2016, pp.1-12.
https://search.emarefa.net/detail/BIM-1112608
Data Type
Journal Articles
Language
English
Notes
Includes bibliographical references
Record ID
BIM-1112608