Exploring the Performance of Tagging for the Classical and the Modern Standard Arabic
Joint Authors
AbuZeina, Dia
Abdalbaset, Taqieddin Mostafa
Source
Issue
Vol. 2019, Issue 2019 (31 Dec. 2019), pp.1-10, 10 p.
Publisher
Hindawi Publishing Corporation
Publication Date
2019-01-23
Country of Publication
Egypt
No. of Pages
10
Main Subjects
Abstract EN
The part of speech (PoS) tagging is a core component in many natural language processing (NLP) applications.
In fact, the PoS taggers contribute as a preprocessing step in various NLP tasks, such as syntactic parsing, information extraction, machine translation, and speech synthesis.
In this paper, we examine the performance of a modern standard Arabic (MSA) based tagger for the classical (i.e., traditional or historical) Arabic.
In this work, we employed the Stanford Arabic model tagger to evaluate the imperative verbs in the Holy Quran.
In fact, the Stanford tagger contains 29 tags; however, this work experimentally evaluates just one that is the VB ≡ imperative verb.
The testing set contains 741 imperative verbs, which appear in 1,848 positions in the Holy Quran.
Despite the previously reported accuracy of the Arabic model of the Stanford tagger, which is 96.26% for all tags and 80.14% for unknown words, the experimental results show that this accuracy is only 7.28% for the imperative verbs.
This result promotes the need for further research to expose why the tagging is severely inaccurate for classical Arabic.
The performance decline might be an indication of the necessity to distinguish between training data for both classical and MSA Arabic for NLP tasks.
American Psychological Association (APA)
AbuZeina, Dia& Abdalbaset, Taqieddin Mostafa. 2019. Exploring the Performance of Tagging for the Classical and the Modern Standard Arabic. Advances in Fuzzy Systems،Vol. 2019, no. 2019, pp.1-10.
https://search.emarefa.net/detail/BIM-1118062
Modern Language Association (MLA)
AbuZeina, Dia& Abdalbaset, Taqieddin Mostafa. Exploring the Performance of Tagging for the Classical and the Modern Standard Arabic. Advances in Fuzzy Systems No. 2019 (2019), pp.1-10.
https://search.emarefa.net/detail/BIM-1118062
American Medical Association (AMA)
AbuZeina, Dia& Abdalbaset, Taqieddin Mostafa. Exploring the Performance of Tagging for the Classical and the Modern Standard Arabic. Advances in Fuzzy Systems. 2019. Vol. 2019, no. 2019, pp.1-10.
https://search.emarefa.net/detail/BIM-1118062
Data Type
Journal Articles
Language
English
Notes
Includes bibliographical references
Record ID
BIM-1118062