Using image pre-classification to improve the accuracy of the image captioning systems

العناوين الأخرى

استخدام التصنيف المسبق للصور لتحسين دقة أنظمة وصف الصور

المؤلفون المشاركون

al-Khayr, Jafar
Sulayman, Samir
Mualla, Rasha

المصدر

Damascus University Journal of Engineering Sciences

العدد

المجلد 37، العدد 2 (30 يونيو/حزيران 2021)، ص ص. 111-124، 14ص.

الناشر

جامعة دمشق

تاريخ النشر

2021-06-30

دولة النشر

سوريا

عدد الصفحات

14

التخصصات الرئيسية

تكنولوجيا المعلومات وعلم الحاسوب

الملخص EN

Deep learning for the purpose of image description and captioning has been one of the most promised computer science application recently.

it consists of two parts; the image and the text description models.

in previous researches, we studied the effect of using different languages and datasets on the image description models.

in this paper, we study the classification effect of the image dataset on those models.

so, a new combined 12000-images dataset consisting of two international datasets (Flickr2k and MS-COCO) is built.

the designed models support Arabic and English languages.

for the description part, we used two different scenarios.

in the first scenario, we used CNN and LSTM models.

while for the second one, resnet50 and fasttext are used as image and text models respectively.

the training is applied for both indoor and outdoor classes.

tests scenarios are applied in two cases and four ways which are the word-by-word and the sentence-by-sentence models.

the performance analysis proves that classified classes have a higher performance than the unclassified ones in case of repeating-based and non-repeating-based datasets in all scenarios.

نمط استشهاد جمعية علماء النفس الأمريكية (APA)

Mualla, Rasha& al-Khayr, Jafar& Sulayman, Samir. 2021. Using image pre-classification to improve the accuracy of the image captioning systems. Damascus University Journal of Engineering Sciences،Vol. 37, no. 2, pp.111-124.
https://search.emarefa.net/detail/BIM-1417230

نمط استشهاد الجمعية الأمريكية للغات الحديثة (MLA)

al-Khayr, Jafar…[et al.]. Using image pre-classification to improve the accuracy of the image captioning systems. Damascus University Journal of Engineering Sciences Vol. 37, no. 2 (2021), pp.111-124.
https://search.emarefa.net/detail/BIM-1417230

نمط استشهاد الجمعية الطبية الأمريكية (AMA)

Mualla, Rasha& al-Khayr, Jafar& Sulayman, Samir. Using image pre-classification to improve the accuracy of the image captioning systems. Damascus University Journal of Engineering Sciences. 2021. Vol. 37, no. 2, pp.111-124.
https://search.emarefa.net/detail/BIM-1417230

نوع البيانات

مقالات

لغة النص

الإنجليزية

الملاحظات

Includes bibliographical references : p. 123-124

رقم السجل

BIM-1417230