Arabic optical character segmentation stage

Other Title(s)

مرحلة التجزئة في أنظمة التعرف الضوئي على حروف اللغة العربية

Dissertant

Ayyash, Muna Abd al-Fattah

Thesis advisor

Muhammad, Khidr

Comitee Members

al-Sadih, Ahmad
Mafarjah, Majdi

University

Birzeit University

Faculty

Faculty of Engineering and Technology

Department

Department of Computer Science

University Country

Palestine (West Bank)

Degree

Master

Degree Date

2016

English Abstract

A considerable progress in recognition techniques for many non-Arabic characters has been achieved.

In contrary, few efforts have been put on the research of Arabic characters.

In any Optical Character Recognition (OCR) system the segmentation step is usually the essential stage in which an extensive portion of processing is devoted and a considerable share of recognition errors is attributed.

In this research, a novel segmentation approach for machine Arabic printed text for different segmentation stages; line segmentation, word and sub-word segmentation, and character segmentation including diacritics with different font types, styles, and sizes are proposed.

The proposed approach based on profile projection techniques, finding global maximum peak, connected components, region properties, and extracting the word's contour.

These methods reduce computation, errors, and give a clear description for the sub-word.

Both of evaluation and testing of the proposed methods have been developed using MATLAB and shows nearly 98% accuracy for different segmentation stages

Main Subjects

Information Technology and Computer Science

No. of Pages

114

Table of Contents

Table of contents.

Abstract.

Abstract in Arabic.

Chapter One : Introduction.

Chapter Two : Background and literature review.

Chapter Three : Proposed work.

Chapter Four : Testing and results.

Chapter Five : Conclusion and future work.

References.

American Psychological Association (APA)

Ayyash, Muna Abd al-Fattah. (2016). Arabic optical character segmentation stage. (Master's theses Theses and Dissertations Master). Birzeit University, Palestine (West Bank)
https://search.emarefa.net/detail/BIM-762526

Modern Language Association (MLA)

Ayyash, Muna Abd al-Fattah. Arabic optical character segmentation stage. (Master's theses Theses and Dissertations Master). Birzeit University. (2016).
https://search.emarefa.net/detail/BIM-762526

American Medical Association (AMA)

Ayyash, Muna Abd al-Fattah. (2016). Arabic optical character segmentation stage. (Master's theses Theses and Dissertations Master). Birzeit University, Palestine (West Bank)
https://search.emarefa.net/detail/BIM-762526

Language

English

Data Type

Arab Theses

Record ID

BIM-762526