Comprehensive processing for Arabic texts to extract their roots

العناوين الأخرى

معالجة شاملة للنصوص العربية لاستخلاص الجذور الصحيحة

المؤلف

Abu Salih, Bilal

المصدر

Iraqi Journal of Science

العدد

المجلد 60، العدد 6 (30 يونيو/حزيران 2019)، ص ص. 1404-1411، 8ص.

الناشر

جامعة بغداد كلية العلوم

تاريخ النشر

2019-06-30

دولة النشر

العراق

عدد الصفحات

8

التخصصات الرئيسية

تكنولوجيا المعلومات وعلم الحاسوب
اللغة العربية وآدابها

الموضوعات

الملخص EN

Arabic language is a highly inflectional language where a single word can have different forms using a single root with different interpretations.

Arabic does not have a standard way to find roots, the reasons for having inflectional language: suffix, prefix and infix Vowels, which built in complex processes.

That is why, words require good processing for information retrieval solutions, until now, and there has been no standard approach to attaining the fully proper root.

The applications on Arabic words show around 99% are derived from a combination of bilateral, Trilateral and quad lateral roots.

Processing word- stemming levels in order to extract a root is the process of removing all additional affixes.

In case the process of matching between a word and Proper names is available, take off the affixes away, according to patterns and rules with reference to root dictionaries.

This research is new series of steps using a new way of affixes' browsing, vowels and Patterns through three stages of stemming.

I f a match is not found, vowel replacement and patterns readjusted to check, if not, then the word is kept unmodified.

Search engine, indexing, file classification, clustering etc.

need developing the root extraction, where the researcher will introduce recommendations and solutions that participate in improving Arabic root extraction.

Research applies comprehensive processing on general collection of documents that done gradually to improve the root extraction by 96%.

نمط استشهاد جمعية علماء النفس الأمريكية (APA)

Abu Salih, Bilal. 2019. Comprehensive processing for Arabic texts to extract their roots. Iraqi Journal of Science،Vol. 60, no. 6, pp.1404-1411.
https://search.emarefa.net/detail/BIM-969413

نمط استشهاد الجمعية الأمريكية للغات الحديثة (MLA)

Abu Salih, Bilal. Comprehensive processing for Arabic texts to extract their roots. Iraqi Journal of Science Vol. 60, no. 6 (2019), pp.1404-1411.
https://search.emarefa.net/detail/BIM-969413

نمط استشهاد الجمعية الطبية الأمريكية (AMA)

Abu Salih, Bilal. Comprehensive processing for Arabic texts to extract their roots. Iraqi Journal of Science. 2019. Vol. 60, no. 6, pp.1404-1411.
https://search.emarefa.net/detail/BIM-969413

نوع البيانات

مقالات

لغة النص

الإنجليزية

الملاحظات

Includes bibliographical references : p. 1411

رقم السجل

BIM-969413