Text Extraction from Historical Document Images by the Combination of Several Thresholding Techniques

المؤلفون المشاركون

Sari, Toufik
Kefali, Abderrahmane
Bahi, Halima

المصدر

Advances in Multimedia

العدد

المجلد 2014، العدد 2014 (31 ديسمبر/كانون الأول 2014)، ص ص. 1-10، 10ص.

الناشر

Hindawi Publishing Corporation

تاريخ النشر

2014-09-28

دولة النشر

مصر

عدد الصفحات

10

التخصصات الرئيسية

تكنولوجيا المعلومات وعلم الحاسوب

الملخص EN

This paper presents a new technique for the binarization of historical document images characterized by deteriorations and damages making their automatic processing difficult at several levels.

The proposed method is based on hybrid thresholding combining the advantages of global and local methods and on the mixture of several binarization techniques.

Two stages have been included.

In the first stage, global thresholding is applied on the entire image and two different thresholds are determined from which the most of image pixels are classified into foreground or background.

In the second stage, the remaining pixels are assigned to foreground or background classes based on local analysis.

In this stage, several local thresholding methods are combined and the final binary value of each remaining pixel is chosen as the most probable one.

The proposed technique has been tested on a large collection of standard and synthetic documents and compared with well-known methods using standard measures and was shown to be more powerful.

نمط استشهاد جمعية علماء النفس الأمريكية (APA)

Sari, Toufik& Kefali, Abderrahmane& Bahi, Halima. 2014. Text Extraction from Historical Document Images by the Combination of Several Thresholding Techniques. Advances in Multimedia،Vol. 2014, no. 2014, pp.1-10.
https://search.emarefa.net/detail/BIM-1015492

نمط استشهاد الجمعية الأمريكية للغات الحديثة (MLA)

Sari, Toufik…[et al.]. Text Extraction from Historical Document Images by the Combination of Several Thresholding Techniques. Advances in Multimedia No. 2014 (2014), pp.1-10.
https://search.emarefa.net/detail/BIM-1015492

نمط استشهاد الجمعية الطبية الأمريكية (AMA)

Sari, Toufik& Kefali, Abderrahmane& Bahi, Halima. Text Extraction from Historical Document Images by the Combination of Several Thresholding Techniques. Advances in Multimedia. 2014. Vol. 2014, no. 2014, pp.1-10.
https://search.emarefa.net/detail/BIM-1015492

نوع البيانات

مقالات

لغة النص

الإنجليزية

الملاحظات

Includes bibliographical references

رقم السجل

BIM-1015492