دراسة منهجيات التشكيل الآلي للنصوص العربية بهدف وضع خطة عمل لبناء مشكل آلي مفتوح المصدر

Other Title(s)

Exploring Arabic text diacritization approaches in view of establishing an action plan for developing an open source diacritizer

Time cited in Arcif : 
1

Joint Authors

غنيم، ندى
ربداوي، غيداء

Source

مجلة جامعة دمشق للعلوم الهندسية

Issue

Vol. 29, Issue 2 (31 Dec. 2013), pp.277-295, 19 p.

Publisher

Damascus University

Publication Date

2013-12-31

Country of Publication

Syria

No. of Pages

19

Main Subjects

Information Technology and Computer Science

Abstract EN

The absence of diacritization in Arabic texts is one of the most important challenges facing the automatic Arabic Language processing.

When reading, Arabic reader can expect the correct diacritics of words, while computers need algorithms to restore the diacritization based on knowledge of different levels.

Diacritization here includes all the diacritics (dama, fatha, kasra, sokon), in addition to alshadda, and altanween.

Some diacritization methods are based on the linguistic processing of texts, while other methods are based on statistical methods using textual corpus.

Some systems integrate the two methodologies in hybrid approaches.

In this paper we present a comprehensive study of different methods that have been adopted in these diacritization systems.

In addition, we review the various corpuses that have been used for tests and evaluation, then suggest the specifications of the Arabic corpus needed for diacritization systems, and the standards that the evaluation process must take into consideration.

The main objective is to develop an action plan for the construction of an automatic diacritizer of Arabic texts under the auspices of ALECSO, with the participation of many research entities from different countries.

American Psychological Association (APA)

غنيم، ندى وربداوي، غيداء. 2013. دراسة منهجيات التشكيل الآلي للنصوص العربية بهدف وضع خطة عمل لبناء مشكل آلي مفتوح المصدر. مجلة جامعة دمشق للعلوم الهندسية،مج. 29، ع. 2، ص ص. 277-295.
https://search.emarefa.net/detail/BIM-751044

Modern Language Association (MLA)

غنيم، ندى وربداوي، غيداء. دراسة منهجيات التشكيل الآلي للنصوص العربية بهدف وضع خطة عمل لبناء مشكل آلي مفتوح المصدر. مجلة جامعة دمشق للعلوم الهندسية مج. 29، ع. 2 (2013)، ص ص. 277-295.
https://search.emarefa.net/detail/BIM-751044

American Medical Association (AMA)

غنيم، ندى وربداوي، غيداء. دراسة منهجيات التشكيل الآلي للنصوص العربية بهدف وضع خطة عمل لبناء مشكل آلي مفتوح المصدر. مجلة جامعة دمشق للعلوم الهندسية. 2013. مج. 29، ع. 2، ص ص. 277-295.
https://search.emarefa.net/detail/BIM-751044

Data Type

Journal Articles

Language

Arabic

Notes

يتضمن هوامش.

Record ID

BIM-751044