Proposed Framework for the Evaluation of Standalone Corpora Processing Systems: An Application to Arabic Corpora
Joint Authors
Al-Thubaity, Abdulmohsen
Al-Khalifa, Hend
Alqifari, Reem
Almazrua, Manal
Source
Issue
Vol. 2014, Issue 2014 (31 Dec. 2014), pp.1-10, 10 p.
Publisher
Hindawi Publishing Corporation
Publication Date
2015-01-08
Country of Publication
Egypt
No. of Pages
10
Main Subjects
Medicine
Information Technology and Computer Science
Abstract EN
Despite the accessibility of numerous online corpora, students and researchers engaged in the fields of Natural Language Processing (NLP), corpus linguistics, and language learning and teaching may encounter situations in which they need to develop their own corpora.
Several commercial and free standalone corpora processing systems are available to process such corpora.
In this study, we first propose a framework for the evaluation of standalone corpora processing systems and then use it to evaluate seven freely available systems.
The proposed framework considers the usability, functionality, and performance of the evaluated systems while taking into consideration their suitability for Arabic corpora.
While the results show that most of the evaluated systems exhibited comparable usability scores, the scores for functionality and performance were substantially different with respect to support for the Arabic language and N-grams profile generation.
The results of our evaluation will help potential users of the evaluated systems to choose the system that best meets their needs.
More importantly, the results will help the developers of the evaluated systems to enhance their systems and developers of new corpora processing systems by providing them with a reference framework.
American Psychological Association (APA)
Al-Thubaity, Abdulmohsen& Al-Khalifa, Hend& Alqifari, Reem& Almazrua, Manal. 2015. Proposed Framework for the Evaluation of Standalone Corpora Processing Systems: An Application to Arabic Corpora. The Scientific World Journal،Vol. 2014, no. 2014, pp.1-10.
https://search.emarefa.net/detail/BIM-1050288
Modern Language Association (MLA)
Al-Thubaity, Abdulmohsen…[et al.]. Proposed Framework for the Evaluation of Standalone Corpora Processing Systems: An Application to Arabic Corpora. The Scientific World Journal No. 2014 (2014), pp.1-10.
https://search.emarefa.net/detail/BIM-1050288
American Medical Association (AMA)
Al-Thubaity, Abdulmohsen& Al-Khalifa, Hend& Alqifari, Reem& Almazrua, Manal. Proposed Framework for the Evaluation of Standalone Corpora Processing Systems: An Application to Arabic Corpora. The Scientific World Journal. 2015. Vol. 2014, no. 2014, pp.1-10.
https://search.emarefa.net/detail/BIM-1050288
Data Type
Journal Articles
Language
English
Notes
Includes bibliographical references
Record ID
BIM-1050288