An Ant Colony Optimization Based Feature Selection for Web Page Classification

Joint Authors

Saraç, Esra
Özel, Selma Ayşe

Source

The Scientific World Journal

Issue

Vol. 2014, Issue 2014 (31 Dec. 2014), pp.1-16, 16 p.

Publisher

Hindawi Publishing Corporation

Publication Date

2014-07-17

Country of Publication

Egypt

No. of Pages

16

Main Subjects

Medicine
Information Technology and Computer Science

Abstract EN

The increased popularity of the web has caused the inclusion of huge amount of information to the web, and as a result of this explosive information growth, automated web page classification systems are needed to improve search engines’ performance.

Web pages have a large number of features such as HTML/XML tags, URLs, hyperlinks, and text contents that should be considered during an automated classification process.

The aim of this study is to reduce the number of features to be used to improve runtime and accuracy of the classification of web pages.

In this study, we used an ant colony optimization (ACO) algorithm to select the best features, and then we applied the well-known C4.5, naive Bayes, and k nearest neighbor classifiers to assign class labels to web pages.

We used the WebKB and Conference datasets in our experiments, and we showed that using the ACO for feature selection improves both accuracy and runtime performance of classification.

We also showed that the proposed ACO based algorithm can select better features with respect to the well-known information gain and chi square feature selection methods.

American Psychological Association (APA)

Saraç, Esra& Özel, Selma Ayşe. 2014. An Ant Colony Optimization Based Feature Selection for Web Page Classification. The Scientific World Journal،Vol. 2014, no. 2014, pp.1-16.
https://search.emarefa.net/detail/BIM-1050501

Modern Language Association (MLA)

Saraç, Esra& Özel, Selma Ayşe. An Ant Colony Optimization Based Feature Selection for Web Page Classification. The Scientific World Journal No. 2014 (2014), pp.1-16.
https://search.emarefa.net/detail/BIM-1050501

American Medical Association (AMA)

Saraç, Esra& Özel, Selma Ayşe. An Ant Colony Optimization Based Feature Selection for Web Page Classification. The Scientific World Journal. 2014. Vol. 2014, no. 2014, pp.1-16.
https://search.emarefa.net/detail/BIM-1050501

Data Type

Journal Articles

Language

English

Notes

Includes bibliographical references

Record ID

BIM-1050501