Investigating the efficiency of wordnet as background knowledge for document clustering

المؤلفون المشاركون

Nafi, Rami
al-agah, Iyad

المصدر

Journal of Engineering Research and Technology

العدد

المجلد 2، العدد 2 (30 يونيو/حزيران 2015)، ص ص. 152-158، 7ص.

الناشر

الجامعة الإسلامية-غزة عمادة شؤون البحث العلمي و الدراسات العليا

تاريخ النشر

2015-06-30

دولة النشر

فلسطين (قطاع غزة)

عدد الصفحات

7

التخصصات الرئيسية

العلوم الاقتصادية والمالية وإدارة الأعمال

الملخص EN

Traditional techniques of document clustering do not consider the semantic relationships between words when assigning documents to clusters.

For instance, if two documents talk about the same topic but by using different words, these techniques may assign documents to different clusters.

Many efforts have approached this problem by enriching the document’s representation with background knowledge from WordNet.

These efforts, however, often showed conflicting results: While some researches claimed that WordNet had the potential to improve the clustering performance by its capability to capture and estimate similarities between words, other researches claimed that WordNet provided little or no enhancement to the obtained clusters.

This work aims to experimentally resolve this contradiction between the two teams, and explain why WordNet could be useful in some cases while not in others, and what factors can influence the use of WordNet for document clustering.

We conducted a set of experiments in which WordNet was used for document clustering with various settings including different datasets, different ways of incorporating semantics into the document’s representation and different similarity measures.

Results showed that different experimental settings may yield different clusters: For example, the influence of WordNet’s semantic features varies according to the dataset being used.

Results also revealed that WordNet-based similarity measures do not seem to improve clustering, and that there was no certain measure to ensure the best clustering results.

نمط استشهاد جمعية علماء النفس الأمريكية (APA)

al-agah, Iyad& Nafi, Rami. 2015. Investigating the efficiency of wordnet as background knowledge for document clustering. Journal of Engineering Research and Technology،Vol. 2, no. 2, pp.152-158.
https://search.emarefa.net/detail/BIM-717483

نمط استشهاد الجمعية الأمريكية للغات الحديثة (MLA)

al-agah, Iyad& Nafi, Rami. Investigating the efficiency of wordnet as background knowledge for document clustering. Journal of Engineering Research and Technology Vol. 2, no. 2 (Jun. 2015), pp.152-158.
https://search.emarefa.net/detail/BIM-717483

نمط استشهاد الجمعية الطبية الأمريكية (AMA)

al-agah, Iyad& Nafi, Rami. Investigating the efficiency of wordnet as background knowledge for document clustering. Journal of Engineering Research and Technology. 2015. Vol. 2, no. 2, pp.152-158.
https://search.emarefa.net/detail/BIM-717483

نوع البيانات

مقالات

لغة النص

الإنجليزية

الملاحظات

Includes bibliographical references : p. 157-158

رقم السجل

BIM-717483