Resource optimization in automatic web page classification using integrated feature selection and machine learning

المؤلفون المشاركون

Ramasamy, Rajaram
Mahadevan, Indra
Karuppasamy, Selvakuberan

المصدر

International Arab Journal of E-Technology

العدد

المجلد 1، العدد 1 (31 يناير/كانون الثاني 2009)10ص.

الناشر

الجامعة العربية المفتوحة

تاريخ النشر

2009-01-31

دولة النشر

الأردن

عدد الصفحات

10

التخصصات الرئيسية

تكنولوجيا المعلومات وعلم الحاسوب

الموضوعات

الملخص EN

Increasing with the number of users, the need for automatic classification techniques with good classification accuracy increases as search engines depend on previously classified web pages stored in classified directories to retrieve the relevant results.

Preprocessing is the important step in web page classification problem as most of the web pages contain more irrelevant information than relevant details useful for finding its category.

For the classification purpose the representative words in the web page called as features are used for classification rather than the entire web page to reduce time and space requirements.

The selection of relevant features which reduces the high dimensionality and redundancy is the current research topic.

In this paper selecting relevant features from the pages is treated as an optimization problem and we propose an algorithm to find the optimal features for web page classification.

Machine learning techniques for automatic classification gains more interest as the classifier improves its performan.

نمط استشهاد جمعية علماء النفس الأمريكية (APA)

Mahadevan, Indra& Karuppasamy, Selvakuberan& Ramasamy, Rajaram. 2009. Resource optimization in automatic web page classification using integrated feature selection and machine learning. International Arab Journal of E-Technology،Vol. 1, no. 1.
https://search.emarefa.net/detail/BIM-38680

نمط استشهاد الجمعية الأمريكية للغات الحديثة (MLA)

Mahadevan, Indra…[et al.]. Resource optimization in automatic web page classification using integrated feature selection and machine learning. International Arab Journal of E-Technology Vol. 1, no. 1 (Jan. 2009).
https://search.emarefa.net/detail/BIM-38680

نمط استشهاد الجمعية الطبية الأمريكية (AMA)

Mahadevan, Indra& Karuppasamy, Selvakuberan& Ramasamy, Rajaram. Resource optimization in automatic web page classification using integrated feature selection and machine learning. International Arab Journal of E-Technology. 2009. Vol. 1, no. 1.
https://search.emarefa.net/detail/BIM-38680

نوع البيانات

مقالات

لغة النص

الإنجليزية

الملاحظات

Includes bibliographical references

رقم السجل

BIM-38680