Towards a distributed Arabic OCR based on the DTW algorithm : performance analysis

Joint Authors

Belghith, Abd al-Fattah
Khemakhem, Mahir

Source

The International Arab Journal of Information Technology

Issue

Vol. 6, Issue 2 (30 Apr. 2009), pp.153-161, 9 p.

Publisher

Zarqa University

Publication Date

2009-04-30

Country of Publication

Jordan

No. of Pages

9

Main Subjects

Information Technology and Computer Science

Abstract EN

In spite of the diversity of printed Arabic optical character recognition products and proposals, the problem seems to be not yet well solved.

The complex morphology and calligraphy of the Arabic writing on one hand and the use of some light approaches on the other hand are behind the poorness of these products.

However, some strong proposed approaches didn’t find the opportunity to be commercialized because of generally their corresponding complex computing.

The dynamic time warping algorithm is considered as one among these strong approaches.

In fact, several studies and experiments have shown and confirmed that the printed Arabic optical character recognition based on dynamic time warping algorithm provides a very interesting recognition rate especially for large and huge vocabularies.

One of the attractive sides of the dynamic time warping algorithm is its ability to recognize properly connected or cursive characters (words or sub words) without prior segmentation.

Furthermore, this algorithm performs the recognition process from within a reference library of isolated characters and owns a very good immunity against noises.

Unfortunately, the big amount of its computing during the recognition process makes its execution time very slow and, hence, restricts its utilization.

Many researchers attempted to speed up the execution time of this algorithm.

Unfortunately, the corresponding proposed solutions require generally specific high cost architectures.

Loosely coupled architectures such as grapes or grid computing can provide enough power without additional cost to distribute the complexity of some greedy applications.

Consequently, we report in this paper the performance analysis of an analytical and an experimental study of a distributed Arabic optical character recognition based on the dynamic time warping algorithm within loosely coupled architectures.

Obtained results confirm that loosely coupled architectures and more specifically grid computing present a very interesting framework to speed up the Arabic optical character recognition based on the dynamic time warping algorithm.

American Psychological Association (APA)

Khemakhem, Mahir& Belghith, Abd al-Fattah. 2009. Towards a distributed Arabic OCR based on the DTW algorithm : performance analysis. The International Arab Journal of Information Technology،Vol. 6, no. 2, pp.153-161.
https://search.emarefa.net/detail/BIM-10526

Modern Language Association (MLA)

Khemakhem, Mahir& Belghith, Abd al-Fattah. Towards a distributed Arabic OCR based on the DTW algorithm : performance analysis. The International Arab Journal of Information Technology Vol. 6, no. 2 (Apr. 2009), pp.153-161.
https://search.emarefa.net/detail/BIM-10526

American Medical Association (AMA)

Khemakhem, Mahir& Belghith, Abd al-Fattah. Towards a distributed Arabic OCR based on the DTW algorithm : performance analysis. The International Arab Journal of Information Technology. 2009. Vol. 6, no. 2, pp.153-161.
https://search.emarefa.net/detail/BIM-10526

Data Type

Journal Articles

Language

English

Notes

Includes bibliographical references: p. 160-p161

Record ID

BIM-10526