A semantic framework for extracting taxonomic relations from text corpus

المؤلفون المشاركون

Doan, Phuoc Thi Hong
Arch Int, Ngamnij
Arch Int, Somjit

المصدر

The International Arab Journal of Information Technology

العدد

المجلد 17، العدد 3 (31 مايو/أيار 2020)، ص ص. 325-337، 13ص.

الناشر

جامعة الزرقاء عمادة البحث العلمي

تاريخ النشر

2020-05-31

دولة النشر

الأردن

عدد الصفحات

13

التخصصات الرئيسية

تكنولوجيا المعلومات وعلم الحاسوب

الموضوعات

الملخص EN

Nowadays, ontologies have been exploited in many current applications due to the abilities in representing knowledge and inferring new knowledge.

However, the manual construction of ontologies is tedious and time-consuming.

Therefore, the automated ontology construction from text has been investigated.

The extraction of taxonomic relations between concepts is a crucial step in constructing domain ontologies.

To obtain taxonomic relations from a text corpus, especially when the data is deficient, the approach of using the web as a source of collective knowledge (a.k.a web-based approach) is usually applied.

The important challenge of this approach is how to collect relevant knowledge from a large amount of web pages.

To overcome this issue, we propose a framework that combines Word Sense Disambiguation (WSD) and web approach to extract taxonomic relations from a domain-text corpus.

This framework consists of two main stages: concept extraction and taxonomic-relation extraction.

Concepts acquired from the concept-extraction stage are disambiguated through WSD module and passed to stage of extraction taxonomic relations afterward.

To evaluate the efficiency of the proposed framework, we conduct experiments on datasets about two domains of tourism and sport.

The obtained results show that the proposed method is efficient in corpora which are insufficient or have no training data.

Besides, the proposed method outperforms the state of the art method in corpora having high WSD results.

نمط استشهاد جمعية علماء النفس الأمريكية (APA)

Doan, Phuoc Thi Hong& Arch Int, Ngamnij& Arch Int, Somjit. 2020. A semantic framework for extracting taxonomic relations from text corpus. The International Arab Journal of Information Technology،Vol. 17, no. 3, pp.325-337.
https://search.emarefa.net/detail/BIM-962341

نمط استشهاد الجمعية الأمريكية للغات الحديثة (MLA)

Doan, Phuoc Thi Hong…[et al.]. A semantic framework for extracting taxonomic relations from text corpus. The International Arab Journal of Information Technology Vol. 17, no. 3 (May. 2020), pp.325-337.
https://search.emarefa.net/detail/BIM-962341

نمط استشهاد الجمعية الطبية الأمريكية (AMA)

Doan, Phuoc Thi Hong& Arch Int, Ngamnij& Arch Int, Somjit. A semantic framework for extracting taxonomic relations from text corpus. The International Arab Journal of Information Technology. 2020. Vol. 17, no. 3, pp.325-337.
https://search.emarefa.net/detail/BIM-962341

نوع البيانات

مقالات

لغة النص

الإنجليزية

الملاحظات

Includes bibliographical references : p. 335-336

رقم السجل

BIM-962341