Automatic restoration of Arabic diacritics : a simple, purely statistical approach

Joint Authors

al-Ghamidi, Mansur
Muzaffar, Zishan
al-Hukmi, Hazim

Source

The Arabian Journal for Science and Engineering. Section C, Theme issues

Issue

Vol. 35, Issue 2C(s) (31 Dec. 2010), pp.126-135, 10 p.

Publisher

King Fahd University of Petroleum and Minerals

Publication Date

2010-12-31

Country of Publication

Saudi Arabia

No. of Pages

10

Main Subjects

Languages & Comparative Literature
Information Technology and Computer Science

Topics

Abstract AR

تعرض هذه الورقة طريقة مبتكرة لتشكيل النص العربي آليا.

حيث تعتمد معظم الطرق السابقة المنشورة على أدوات أخرى كأدوات أنموذج ماركوف الخفي و / أو المرعفة اللغوية كالصرف و النحو.

و النظام الذي بين أيدينا هو نظام أساسي يتميز بصغر حجمه و سرعته في المعالجة بمعزل عن الأدوات الحاسوبية المعروفة أو القوانين اللغوية.

إذ يستخدم هذا النظام طريقة إحصائية بحتة تعتمد على احتمالية التسلسل الرباعي للحروف.

و ينتج عنه درجة عالية في دقة التشكيل إذا ما قورن بالأنظمة السابقة التي تعتمد على الإحصاء فقط بوصفه نظاما أساسيا.

Abstract EN

This paper is about an innovative Arabic discretize.

Most of the previous published works on discretization depends on other tools such as Hidden Markov Model Toolkit and / or higher linguistic knowledge such as morphology and syntax.

The present system is a baseline system which is small in size, fast in processing, and independent from other tools and linguistic rules.

The system uses a statistical method that relies on quad-gram probabilities.

Its accuracy rate is relatively high when compared to previous systems that are based only on statistics.

American Psychological Association (APA)

al-Ghamidi, Mansur& Muzaffar, Zishan& al-Hukmi, Hazim. 2010. Automatic restoration of Arabic diacritics : a simple, purely statistical approach. The Arabian Journal for Science and Engineering. Section C, Theme issues،Vol. 35, no. 2C(s), pp.126-135.
https://search.emarefa.net/detail/BIM-308412

Modern Language Association (MLA)

al-Ghamidi, Mansur…[et al.]. Automatic restoration of Arabic diacritics : a simple, purely statistical approach. The Arabian Journal for Science and Engineering. Section C, Theme issues Vol. 35, no. 2C(s) (Dec. 2010), pp.126-135.
https://search.emarefa.net/detail/BIM-308412

American Medical Association (AMA)

al-Ghamidi, Mansur& Muzaffar, Zishan& al-Hukmi, Hazim. Automatic restoration of Arabic diacritics : a simple, purely statistical approach. The Arabian Journal for Science and Engineering. Section C, Theme issues. 2010. Vol. 35, no. 2C(s), pp.126-135.
https://search.emarefa.net/detail/BIM-308412

Data Type

Journal Articles

Language

English

Notes

Includes appendix : p. 134-135

Record ID

BIM-308412