An Embedded-Based Weighted Feature Selection Algorithm for Classifying Web Document

Joint Authors

Siva Shankar, G.
Ashokkumar, P.
Vinayakumar, R.
Ghosh, Uttam
Mansoor, Wathiq
Alnumay, Waleed S.

Source

Wireless Communications and Mobile Computing

Issue

Vol. 2020, Issue 2020 (31 Dec. 2020), pp.1-10, 10 p.

Publisher

Hindawi Publishing Corporation

Publication Date

2020-09-15

Country of Publication

Egypt

No. of Pages

10

Main Subjects

Information Technology and Computer Science

Abstract EN

With the exponential increase in a number of web pages daily, it makes it very difficult for a search engine to list relevant web pages.

In this paper, we propose a machine learning-based classification model that can learn the best features in each web page and helps in search engine listing.

The existing methods for listing have lots of drawbacks like interfacing the normal operations of the website and crawling lots of useless information.

Our proposed algorithm provides an optimal classification for websites which has a large number of web pages such as Wikipedia by just considering core information like link text, side information, and header text.

We implemented our algorithm with standard benchmark datasets, and the results show that our algorithm outperforms the existing algorithms.

American Psychological Association (APA)

Siva Shankar, G.& Ashokkumar, P.& Vinayakumar, R.& Ghosh, Uttam& Mansoor, Wathiq& Alnumay, Waleed S.. 2020. An Embedded-Based Weighted Feature Selection Algorithm for Classifying Web Document. Wireless Communications and Mobile Computing،Vol. 2020, no. 2020, pp.1-10.
https://search.emarefa.net/detail/BIM-1214845

Modern Language Association (MLA)

Siva Shankar, G.…[et al.]. An Embedded-Based Weighted Feature Selection Algorithm for Classifying Web Document. Wireless Communications and Mobile Computing No. 2020 (2020), pp.1-10.
https://search.emarefa.net/detail/BIM-1214845

American Medical Association (AMA)

Siva Shankar, G.& Ashokkumar, P.& Vinayakumar, R.& Ghosh, Uttam& Mansoor, Wathiq& Alnumay, Waleed S.. An Embedded-Based Weighted Feature Selection Algorithm for Classifying Web Document. Wireless Communications and Mobile Computing. 2020. Vol. 2020, no. 2020, pp.1-10.
https://search.emarefa.net/detail/BIM-1214845

Data Type

Journal Articles

Language

English

Notes

Includes bibliographical references

Record ID

BIM-1214845