Automatic restoration of Arabic diacritics : a simple, purely statistical approach
Joint Authors
al-Ghamidi, Mansur
Muzaffar, Zishan
al-Hukmi, Hazim
Source
The Arabian Journal for Science and Engineering. Section C, Theme issues
Issue
Vol. 35, Issue 2C(s) (31 Dec. 2010), pp.126-135, 10 p.
Publisher
King Fahd University of Petroleum and Minerals
Publication Date
2010-12-31
Country of Publication
Saudi Arabia
No. of Pages
10
Main Subjects
Languages & Comparative Literature
Information Technology and Computer Science
Topics
Abstract AR
تعرض هذه الورقة طريقة مبتكرة لتشكيل النص العربي آليا.
حيث تعتمد معظم الطرق السابقة المنشورة على أدوات أخرى كأدوات أنموذج ماركوف الخفي و / أو المرعفة اللغوية كالصرف و النحو.
و النظام الذي بين أيدينا هو نظام أساسي يتميز بصغر حجمه و سرعته في المعالجة بمعزل عن الأدوات الحاسوبية المعروفة أو القوانين اللغوية.
إذ يستخدم هذا النظام طريقة إحصائية بحتة تعتمد على احتمالية التسلسل الرباعي للحروف.
و ينتج عنه درجة عالية في دقة التشكيل إذا ما قورن بالأنظمة السابقة التي تعتمد على الإحصاء فقط بوصفه نظاما أساسيا.
Abstract EN
This paper is about an innovative Arabic discretize.
Most of the previous published works on discretization depends on other tools such as Hidden Markov Model Toolkit and / or higher linguistic knowledge such as morphology and syntax.
The present system is a baseline system which is small in size, fast in processing, and independent from other tools and linguistic rules.
The system uses a statistical method that relies on quad-gram probabilities.
Its accuracy rate is relatively high when compared to previous systems that are based only on statistics.
American Psychological Association (APA)
al-Ghamidi, Mansur& Muzaffar, Zishan& al-Hukmi, Hazim. 2010. Automatic restoration of Arabic diacritics : a simple, purely statistical approach. The Arabian Journal for Science and Engineering. Section C, Theme issues،Vol. 35, no. 2C(s), pp.126-135.
https://search.emarefa.net/detail/BIM-308412
Modern Language Association (MLA)
al-Ghamidi, Mansur…[et al.]. Automatic restoration of Arabic diacritics : a simple, purely statistical approach. The Arabian Journal for Science and Engineering. Section C, Theme issues Vol. 35, no. 2C(s) (Dec. 2010), pp.126-135.
https://search.emarefa.net/detail/BIM-308412
American Medical Association (AMA)
al-Ghamidi, Mansur& Muzaffar, Zishan& al-Hukmi, Hazim. Automatic restoration of Arabic diacritics : a simple, purely statistical approach. The Arabian Journal for Science and Engineering. Section C, Theme issues. 2010. Vol. 35, no. 2C(s), pp.126-135.
https://search.emarefa.net/detail/BIM-308412
Data Type
Journal Articles
Language
English
Notes
Includes appendix : p. 134-135
Record ID
BIM-308412