Exploring the Performance of Tagging for the Classical and the Modern Standard Arabic

Publication Date

2019-01-23

Country of Publication

Egypt

No. of Pages

Main Subjects

Mathematics

Abstract EN

The part of speech (PoS) tagging is a core component in many natural language processing (NLP) applications.

In fact, the PoS taggers contribute as a preprocessing step in various NLP tasks, such as syntactic parsing, information extraction, machine translation, and speech synthesis.

In this paper, we examine the performance of a modern standard Arabic (MSA) based tagger for the classical (i.e., traditional or historical) Arabic.

In this work, we employed the Stanford Arabic model tagger to evaluate the imperative verbs in the Holy Quran.

In fact, the Stanford tagger contains 29 tags; however, this work experimentally evaluates just one that is the VB ≡ imperative verb.

The testing set contains 741 imperative verbs, which appear in 1,848 positions in the Holy Quran.

Despite the previously reported accuracy of the Arabic model of the Stanford tagger, which is 96.26% for all tags and 80.14% for unknown words, the experimental results show that this accuracy is only 7.28% for the imperative verbs.

This result promotes the need for further research to expose why the tagging is severely inaccurate for classical Arabic.

The performance decline might be an indication of the necessity to distinguish between training data for both classical and MSA Arabic for NLP tasks.

American Psychological Association (APA)

AbuZeina, Dia& Abdalbaset, Taqieddin Mostafa. 2019. Exploring the Performance of Tagging for the Classical and the Modern Standard Arabic. Advances in Fuzzy Systems،Vol. 2019, no. 2019, pp.1-10.
https://search.emarefa.net/detail/BIM-1118062

Modern Language Association (MLA)

AbuZeina, Dia& Abdalbaset, Taqieddin Mostafa. Exploring the Performance of Tagging for the Classical and the Modern Standard Arabic. Advances in Fuzzy Systems No. 2019 (2019), pp.1-10.
https://search.emarefa.net/detail/BIM-1118062

American Medical Association (AMA)

AbuZeina, Dia& Abdalbaset, Taqieddin Mostafa. Exploring the Performance of Tagging for the Classical and the Modern Standard Arabic. Advances in Fuzzy Systems. 2019. Vol. 2019, no. 2019, pp.1-10.
https://search.emarefa.net/detail/BIM-1118062

Data Type

Journal Articles

Language

English

Notes

Includes bibliographical references

Record ID

BIM-1118062

SaveSaved Print

Arab Citation & Impact Factor "Arcif"

Largest Arabic Database of Citations Analysis for the Arabic Scholarly Journals Issued in Arab World.

e-Marefa Platform for Arabic Textbook.

"Kashif" for Checking Similarity or Plagiarism in the Arabic Researches. know more