The semantic similarity measures using Arabic ontology

Other Title(s)

مقاييس التشابه الدلالي باستخدام الانتولوجي العربي

Dissertant

al-Dayri, Muhammad Ghandi

Thesis advisor

Kayid, Ahmad K. A.

Comitee Members

al-Sadi, Jihad
Abu Shurayhah, Ahmad

University

Middle East University

Faculty

Faculty of Information Technology

Department

Computer Science Department

University Country

Jordan

Degree

Master

Degree Date

2017

English Abstract

The semantic similarity measures have been used in many applications including information retrieval and natural language processing.

There are many measures that use a lexical database such as WordNet to calculate the similarity between English concepts.

However, few researches have been studied semantic similarity measures using Arabic WordNet.

The traditional semantic similarity measures were classified into four categories: path-based measures, information content-based, feature-based measures, and hybrid measures.

Several measures from different categories have been applied on Arabic WordNet to which measure has the best performance using Arabic WordNet.

Human benchmark has been used to evaluate the performance of these measures over Arabic WordNet.

Experimental results show that the WuP measure has achieved the minimum mean square error (MSE) with value of (1.64%), and highest value of correlation coefficient with human ratings (0.92).

These results indicate that WuP measure has the best performance on Arabic WordNet compared to other measures.

Also, the results show that PATH measure has the worst performance.

This thesis proposed a new semantic similarity measure using the taxonomy of Arabic WordNet.

The new measure takes three factors into account: depth of concepts in Arabic WordNet tree, distance between two compared concepts and information content of the least common concept that subsumed two compared concepts.

The weight of these factors can be adapted manually.

However, several experiments have been conducted to find the best weight that achieves the minimum MSE.

In order to evaluate the new measure, the Arabic dataset that used previously to evaluate the measures has been used to test the new measure.

Then, the results of applying new measure over Arabic WordNet have been compared with the results of the other measures.

However, the results showed that the new measure has achieved the highest correlation coefficient with human ratings (0.96), furthermore, the new measure has obtained a very good value of MSE (1.89%) compared with the other measures

Main Subjects

Information Technology and Computer Science

Topics

No. of Pages

87

Table of Contents

Table of contents.

Abstract.

Abstract in Arabic.

Chapter One : Introduction.

Chapter Two : Literature review and related works.

Chapter Three : Experimental work and new proposed measure.

Chapter Four : Experimental results and measures evaluation.

Chapter Five : Conclusions and future work.

References.

American Psychological Association (APA)

al-Dayri, Muhammad Ghandi. (2017). The semantic similarity measures using Arabic ontology. (Master's theses Theses and Dissertations Master). Middle East University, Jordan
https://search.emarefa.net/detail/BIM-762685

Modern Language Association (MLA)

al-Dayri, Muhammad Ghandi. The semantic similarity measures using Arabic ontology. (Master's theses Theses and Dissertations Master). Middle East University. (2017).
https://search.emarefa.net/detail/BIM-762685

American Medical Association (AMA)

al-Dayri, Muhammad Ghandi. (2017). The semantic similarity measures using Arabic ontology. (Master's theses Theses and Dissertations Master). Middle East University, Jordan
https://search.emarefa.net/detail/BIM-762685

Language

English

Data Type

Arab Theses

Record ID

BIM-762685