An efficient web search engine for noisy free information retrieval

المؤلفون المشاركون

Sahoo, Pradeep
Parthasarthy, Rajagopalan

المصدر

The International Arab Journal of Information Technology

العدد

المجلد 15، العدد 3 (31 مايو/أيار 2018)7ص.

الناشر

جامعة الزرقاء

تاريخ النشر

2018-05-31

دولة النشر

الأردن

عدد الصفحات

7

التخصصات الرئيسية

تكنولوجيا المعلومات وعلم الحاسوب

الملخص EN

The vast growth, various dynamic and low quality of the World Wide Web makes it very difficult to retrieve relevant information from internet during query search.

To resolve this issue, various web mining techniques are being used.

The biggest challenge in web mining is to remove noisy data information or unwanted information from the webpage such as banner, video, audio, images, hyperlinks etc.

which are not associated to a user query.

To overcome these issues, a novel custom search engine is proposed with efficient algorithm in this paper.

The proposed Uniform Resource Locator (URL) pattern extractor algorithm will extract the all relevance index pages from the web and ranking the indexes based on user query.

Then, Noisy Data Cleaner (NDC) algorithm is applied to remove the unwanted content from the retrieved web pages.

The results show that the proposed UPE+NDC algorithm provides very promising results for different datasets with high precision and recall rate in comparison with the existing algorithms.

نمط استشهاد جمعية علماء النفس الأمريكية (APA)

Sahoo, Pradeep& Parthasarthy, Rajagopalan. 2018. An efficient web search engine for noisy free information retrieval. The International Arab Journal of Information Technology،Vol. 15, no. 3.
https://search.emarefa.net/detail/BIM-839230

نمط استشهاد الجمعية الأمريكية للغات الحديثة (MLA)

Sahoo, Pradeep& Parthasarthy, Rajagopalan. An efficient web search engine for noisy free information retrieval. The International Arab Journal of Information Technology Vol. 15, no. 3 (May. 2018).
https://search.emarefa.net/detail/BIM-839230

نمط استشهاد الجمعية الطبية الأمريكية (AMA)

Sahoo, Pradeep& Parthasarthy, Rajagopalan. An efficient web search engine for noisy free information retrieval. The International Arab Journal of Information Technology. 2018. Vol. 15, no. 3.
https://search.emarefa.net/detail/BIM-839230

نوع البيانات

مقالات

لغة النص

الإنجليزية

الملاحظات

Includes bibliographical references

رقم السجل

BIM-839230