A new approach for Arabic named entity recognition

Joint Authors

Karaa, Wahiba
Sulaymani, Thabit

Source

The International Arab Journal of Information Technology

Issue

Vol. 14, Issue 3 (31 May. 2017), pp.332-338, 7 p.

Publisher

Zarqa University

Publication Date

2017-05-31

Country of Publication

Jordan

No. of Pages

7

Main Subjects

Information Technology and Computer Science

Abstract EN

A Named Entity Recognition (NER) plays a noteworthy role in Natural Language Processing (NLP) research, since it makes available the detection of proper nouns in unstructured texts.

NER makes easier searching, retrieving, and extracting information seeing as the significant information in texts is usually sited around proper names.

This paper suggests an efficient approach that can identify Named Entities (NE) in Arabic texts without the need for morphological or syntactic analysis or gazetteers.

The goal of our approach is to provide a general framework for Arabic NE recognition.

Within this framework; the system learns the recognition of NE automatically and induces NE systematically, starting from sample NE instances as seeds.

This method takes advantage from the web, the approach learns from a web corpus.

The seeds are used to identify the contexts in the web denoting NE and then the contexts identify new NE.

Thorough experimental evaluation of our approach, the performances measured by recall, precision and f-measure conducted to recognize NE are promising.

We obtained an overall rate of F-measure equal to 83%.

American Psychological Association (APA)

Karaa, Wahiba& Sulaymani, Thabit. 2017. A new approach for Arabic named entity recognition. The International Arab Journal of Information Technology،Vol. 14, no. 3, pp.332-338.
https://search.emarefa.net/detail/BIM-819524

Modern Language Association (MLA)

Karaa, Wahiba& Sulaymani, Thabit. A new approach for Arabic named entity recognition. The International Arab Journal of Information Technology Vol. 14, no. 3 (2017), pp.332-338.
https://search.emarefa.net/detail/BIM-819524

American Medical Association (AMA)

Karaa, Wahiba& Sulaymani, Thabit. A new approach for Arabic named entity recognition. The International Arab Journal of Information Technology. 2017. Vol. 14, no. 3, pp.332-338.
https://search.emarefa.net/detail/BIM-819524

Data Type

Journal Articles

Language

English

Notes

Includes bibliographical references : p. 336-337

Record ID

BIM-819524