Modern standard Arabic grammar automatic extraction from Penn Arabic Treebank using natural language toolkit

العناوين الأخرى

استخلاص قواعد النحو لتراكيب جمل اللغة العربية المعاصرة آليا باستخدام عينة لغوية من Penn Arabic Treebank باستخدام NLTK

المؤلفون المشاركون

Abd al-Halim, Amirah
al-Ansari, Samih

المصدر

The Egyptian Journal of Language Engineering

العدد

المجلد 5، العدد 1 (30 إبريل/نيسان 2018)، ص ص. 1-10، 10ص.

الناشر

الجمعية المصرية لهندسة اللغة

تاريخ النشر

2018-04-30

دولة النشر

مصر

عدد الصفحات

10

التخصصات الرئيسية

تكنولوجيا المعلومات وعلم الحاسوب

الملخص EN

This paper presents a methodology for rule based bottom up parsing technique forModern Standard Arabic (MSA) in Context Free Grammar (CFG) formalism in Phrase Structure Grammar (PSG) representation, where the grammar is automatically extracted from a syntactically annotated corpus.The extracted grammar is used to build an automatic lexicon and grammar rules module.

Furthermore, the extracted CFG is further transformed into Probabilistic Context Free Grammar (PCFG) that could be used in a hybrid approach, which is also calculated automatically.

The used corpus is the Penn Arabic Treebank(PATB)and algorithm implementation is performed with Natural Language Processing Toolkit (NLTK).The parser showed that automatic extraction of grammar improved the grammar building phase in both coverage of structures and time needed, but still needs further manual constrains addition.

Automatic extraction of grammar is able to enhance rule based grammar parsers and it will enable a new paradigm of statistically directed symbolic parsing.

نمط استشهاد جمعية علماء النفس الأمريكية (APA)

Abd al-Halim, Amirah& al-Ansari, Samih. 2018. Modern standard Arabic grammar automatic extraction from Penn Arabic Treebank using natural language toolkit. The Egyptian Journal of Language Engineering،Vol. 5, no. 1, pp.1-10.
https://search.emarefa.net/detail/BIM-941786

نمط استشهاد الجمعية الأمريكية للغات الحديثة (MLA)

Abd al-Halim, Amirah& al-Ansari, Samih. Modern standard Arabic grammar automatic extraction from Penn Arabic Treebank using natural language toolkit. The Egyptian Journal of Language Engineering Vol. 5, no. 1 (Apr. 2018), pp.1-10.
https://search.emarefa.net/detail/BIM-941786

نمط استشهاد الجمعية الطبية الأمريكية (AMA)

Abd al-Halim, Amirah& al-Ansari, Samih. Modern standard Arabic grammar automatic extraction from Penn Arabic Treebank using natural language toolkit. The Egyptian Journal of Language Engineering. 2018. Vol. 5, no. 1, pp.1-10.
https://search.emarefa.net/detail/BIM-941786

نوع البيانات

مقالات

لغة النص

الإنجليزية

الملاحظات

رقم السجل

BIM-941786