Constructing a lexicon of Arabic-English named entity using SMT and semantic linked data

المؤلفون المشاركون

Hkiri, Emna
Mallat, Suhayl
Zrigui, Munir

المصدر

The International Arab Journal of Information Technology

العدد

المجلد 14، العدد 6 (30 نوفمبر/تشرين الثاني 2017)6ص.

الناشر

جامعة الزرقاء

تاريخ النشر

2017-11-30

دولة النشر

الأردن

عدد الصفحات

6

التخصصات الرئيسية

تكنولوجيا المعلومات وعلم الحاسوب

الملخص EN

Named entity recognition is the problem of locating and categorizing atomic entities in a given text.

In this work, we used DBpedia Linked datasets and combined existing open source tools to generate from a parallel corpus a bilingual lexicon of Named Entities (NE).

To annotate NE in the monolingual English corpus, we used linked data entities by mapping them to Gate Gazetteers.

In order to translate entities identified by the gate tool from the English corpus, we used moses, a statistical machine translation system.

The construction of the Arabic-English named entities lexicon is based on the results of moses translation.

Our method is fully automatic and aims to help Natural Language Processing (NLP) tasks such as, machine translation information retrieval, text mining and question answering.

Our lexicon contains 48753 pairs of Arabic-English NE, it is freely available for use by other researchers

نمط استشهاد جمعية علماء النفس الأمريكية (APA)

Hkiri, Emna& Mallat, Suhayl& Zrigui, Munir. 2017. Constructing a lexicon of Arabic-English named entity using SMT and semantic linked data. The International Arab Journal of Information Technology،Vol. 14, no. 6.
https://search.emarefa.net/detail/BIM-853091

نمط استشهاد الجمعية الأمريكية للغات الحديثة (MLA)

Hkiri, Emna…[et al.]. Constructing a lexicon of Arabic-English named entity using SMT and semantic linked data. The International Arab Journal of Information Technology Vol. 14, no. 6 (Nov. 2017).
https://search.emarefa.net/detail/BIM-853091

نمط استشهاد الجمعية الطبية الأمريكية (AMA)

Hkiri, Emna& Mallat, Suhayl& Zrigui, Munir. Constructing a lexicon of Arabic-English named entity using SMT and semantic linked data. The International Arab Journal of Information Technology. 2017. Vol. 14, no. 6.
https://search.emarefa.net/detail/BIM-853091

نوع البيانات

مقالات

لغة النص

الإنجليزية

الملاحظات

Includes bibliographical references

رقم السجل

BIM-853091