Automatic linking of short Arabic text to Wikipedia articles

العناوين الأخرى

الربط التلقائي للنصوص العربية القصيرة مع مقالات الويكيبيديا

مقدم أطروحة جامعية

Fayad, Fatum M. A.

مشرف أطروحة جامعية

al-Agha, Iyad M.

الجامعة

الجامعة الإسلامية

الكلية

كلية تكنولوجيا المعلومات

القسم الأكاديمي

تكنولوجيا المعلومات

دولة الجامعة

فلسطين (قطاع غزة)

الدرجة العلمية

ماجستير

تاريخ الدرجة العلمية

2016

الملخص الإنجليزي

Given the enormous amount of unstructured texts available on the Web, there has been an emerging need to increase discoverability of and accessibility to these texts.

One of the proposed solutions is to annotate texts with information extracted from background knowledge.

Wikipedia, the free encyclopedia, has been recently exploited as a background knowledge to dynamically annotate text with complementary information.

Given any piece of text the main challenge is how to determine the most relevant information from Wikipedia with the least effort and time.

While Wikipedia-based annotation has mainly targeted the English and Latin versions of Wikipedia, little effort has been devoted to annotate Arabic text using the Arabic version of Wikipedia.

This work proposes an approach for dynamic linking of Arabic short texts to Wikipedia articles.

It reports on the several challenges associated with the design and implementation of the linking approach including the processing and setting up of the Wikipedia's enormous content, the mapping of texts to Wikipedia articles, the problem of article disambiguation, and time efficiency.

The proposed approach focuses on short texts because they are generally more difficult to process and annotate than long texts.

The proposed approach was assessed over a dataset of 100 short texts gathered from online Arabic articles and then work on it offline.

Hyperlinks generated by the approach were compared with the hyperlinks generated by two human raters.

The dynamic linking approach achieved 71.79% accuracy, 74.70% average precision, and 82.63 % average recall.

A thorough analysis and discussion of the evaluation results are also presented to address the limitations and strengths as well as the recommendations for future improvements.

The source code, dataset, and complete experimental results are made available online on: https://github.com/FatoomMFayad/Dynamic-Linking

التخصصات الرئيسية

تكنولوجيا المعلومات وعلم الحاسوب

عدد الصفحات

89

قائمة المحتويات

Table of contents.

Abstract.

Abstract in Arabic.

Chapter One : Introduction.

Chapter Two : Literature review.

Chapter Three : Methodology.

Chapter Four : Results and discussion.

Chapter Five : Conclusions.

References.

نمط استشهاد جمعية علماء النفس الأمريكية (APA)

Fayad, Fatum M. A.. (2016). Automatic linking of short Arabic text to Wikipedia articles. (Master's theses Theses and Dissertations Master). Islamic University, Palestine (Gaza Strip)
https://search.emarefa.net/detail/BIM-727248

نمط استشهاد الجمعية الأمريكية للغات الحديثة (MLA)

Fayad, Fatum M. A.. Automatic linking of short Arabic text to Wikipedia articles. (Master's theses Theses and Dissertations Master). Islamic University. (2016).
https://search.emarefa.net/detail/BIM-727248

نمط استشهاد الجمعية الطبية الأمريكية (AMA)

Fayad, Fatum M. A.. (2016). Automatic linking of short Arabic text to Wikipedia articles. (Master's theses Theses and Dissertations Master). Islamic University, Palestine (Gaza Strip)
https://search.emarefa.net/detail/BIM-727248

لغة النص

الإنجليزية

نوع البيانات

رسائل جامعية

رقم السجل

BIM-727248