Using NLP for semi automated class diagram generation from Arabic text

Other Title(s)

استخدام أدوات معالجة اللغة الطبيعية للأتمتة الشبه آلية لتصميم مخطط الفئة من نص مكتوب باللغة العربية

Dissertant

al-Khatib, Lubnah Ghazi

Thesis advisor

Tartir, Samir

University

Philadelphia University

Faculty

Faculty of Information Technology

Department

Department of Computer Science

University Country

Jordan

Degree

Master

Degree Date

2017

English Abstract

Arabic language is one of the most used languages in the world.

Software requirements of an information system are written in natural language; Arabic or English.

Arabic is the formal language that is used to write software requirements by the most of ministries and governmental institutions in Arab countries.

In software engineering, manual analysis and design of the requirements cause variations of the obtained model due to the complexity of natural language.

Using natural language processing tools in analysing and transforming requirements into models enhances knowledge extraction performance.

Many approaches addressed this domain, but very little research has been developed to obtain models from requirements written in Arabic, especially class model.

Also, they never use semantic approaches that utilize the domain ontologies, which produce a gap between knowledge engineering and software engineering.

In this thesis, we propose an approach that solve the previous problems and automate the process of constructing a class model from Arabic requirements.

The core aim of our approach is to handle Arabic user requirements transformation into class diagrams.

We propose a set of identification rules to identify class diagram elements, then we combine the proposed set of identification rules, ANLP tools and domain ontologies to build a unique approach for generating UML class diagram from Arabic user requirements.

Our approach is the first approach that utilizes domain ontology and Arabic natural language processing tools to generate class diagrams form the Arabic requirements.

Finally, we validate our approach by performing an experiment, which involves a group of human evaluators.

The results showed that our approach produces very good results as compared to the evaluators depending on the performance evaluation criteria; precision and recall.

The results indicated that our proposed technique is a good candidate for extracting class diagrams from Arabic user requirements text as compared to existing approaches.

Main Subjects

Information Technology and Computer Science

No. of Pages

51

Table of Contents

Table of contents.

Abstract.

Abstract in Arabic.

Chapter One : Introduction.

Chapter Two : Related work.

Chapter Three : Research methodology.

Chapter Four : Implementation, results and evaluation.

Chapter Five : Conclusion and Future Work

References.

American Psychological Association (APA)

al-Khatib, Lubnah Ghazi. (2017). Using NLP for semi automated class diagram generation from Arabic text. (Master's theses Theses and Dissertations Master). Philadelphia University, Jordan
https://search.emarefa.net/detail/BIM-955502

Modern Language Association (MLA)

al-Khatib, Lubnah Ghazi. Using NLP for semi automated class diagram generation from Arabic text. (Master's theses Theses and Dissertations Master). Philadelphia University. (2017).
https://search.emarefa.net/detail/BIM-955502

American Medical Association (AMA)

al-Khatib, Lubnah Ghazi. (2017). Using NLP for semi automated class diagram generation from Arabic text. (Master's theses Theses and Dissertations Master). Philadelphia University, Jordan
https://search.emarefa.net/detail/BIM-955502

Language

English

Data Type

Arab Theses

Record ID

BIM-955502