Hybrid support vector machine based feature selection method for text classification

Joint Authors

Ayyash, Musab
Sabah, Thabit
Ashraf, Mahmud

Source

The International Arab Journal of Information Technology

Publisher

Zarqa University

Publication Date

2018-05-31

Country of Publication

Jordan

No. of Pages

11

Main Subjects

Information Technology and Computer Science

English Abstract

Automatic text classification is an effective solution used to sort out the increasing amount of online textual content.

However, high dimensionality is a considerable impediment observed in the text classification field in spite of the fact that there have been many statistical methods available to address this issue.

Still, none of these has proved to be effective enough in solving this problem.

This paper proposes a machine learning based feature ranking and selection method named Support Vector Machine based Feature Ranking Method (SVM-FRM).

The proposed method utilizes Support Vector Machine (SVM) learning algorithm for weighting and selecting the significant features in order to obtain better classification performance.

Later on, hybridization techniques are applied to enhance the performance of SVM-FRM method in some experimental situations.

The proposed SVM-FRM method and its enhancement are tested using three text classification public datasets.

The achieved results are compared with other statistical feature selection methods currently used for the said purpose.

Results evaluation shows higher and superior F-measure and accuracy performances of the proposed SVM-FRM on balanced datasets.

Moreover, a noticeable performance enhancement is recorded due to the application of the proposed hybridization techniques on an unbalanced dataset.

Data Type

Conference Papers

Record ID

BIM-896588

American Psychological Association (APA)

Sabah, Thabit& Ashraf, Mahmud& Ayyash, Musab. 2018-05-31. Hybrid support vector machine based feature selection method for text classification. International Arab Conference on Information Technology (18 : 2017 : Zarqa, Jordan). . Vol. 15, no. 3A (Special issue) (2018), pp.599-609.Zarqa Jordan : Zarqa University.
https://search.emarefa.net/detail/BIM-896588

Modern Language Association (MLA)

Sabah, Thabit…[et al.]. Hybrid support vector machine based feature selection method for text classification. . Zarqa Jordan : Zarqa University. 2018-05-31.
https://search.emarefa.net/detail/BIM-896588

American Medical Association (AMA)

Sabah, Thabit& Ashraf, Mahmud& Ayyash, Musab. Hybrid support vector machine based feature selection method for text classification. . International Arab Conference on Information Technology (18 : 2017 : Zarqa, Jordan).
https://search.emarefa.net/detail/BIM-896588