Using image pre-classification to improve the accuracy of the image captioning systems

Other Title(s)

استخدام التصنيف المسبق للصور لتحسين دقة أنظمة وصف الصور

Joint Authors

al-Khayr, Jafar
Sulayman, Samir
Mualla, Rasha

Source

Damascus University Journal of Engineering Sciences

Issue

Vol. 37, Issue 2 (30 Jun. 2021), pp.111-124, 14 p.

Publisher

Damascus University

Publication Date

2021-06-30

Country of Publication

Syria

No. of Pages

14

Main Subjects

Information Technology and Computer Science

Abstract EN

Deep learning for the purpose of image description and captioning has been one of the most promised computer science application recently.

it consists of two parts; the image and the text description models.

in previous researches, we studied the effect of using different languages and datasets on the image description models.

in this paper, we study the classification effect of the image dataset on those models.

so, a new combined 12000-images dataset consisting of two international datasets (Flickr2k and MS-COCO) is built.

the designed models support Arabic and English languages.

for the description part, we used two different scenarios.

in the first scenario, we used CNN and LSTM models.

while for the second one, resnet50 and fasttext are used as image and text models respectively.

the training is applied for both indoor and outdoor classes.

tests scenarios are applied in two cases and four ways which are the word-by-word and the sentence-by-sentence models.

the performance analysis proves that classified classes have a higher performance than the unclassified ones in case of repeating-based and non-repeating-based datasets in all scenarios.

American Psychological Association (APA)

Mualla, Rasha& al-Khayr, Jafar& Sulayman, Samir. 2021. Using image pre-classification to improve the accuracy of the image captioning systems. Damascus University Journal of Engineering Sciences،Vol. 37, no. 2, pp.111-124.
https://search.emarefa.net/detail/BIM-1417230

Modern Language Association (MLA)

al-Khayr, Jafar…[et al.]. Using image pre-classification to improve the accuracy of the image captioning systems. Damascus University Journal of Engineering Sciences Vol. 37, no. 2 (2021), pp.111-124.
https://search.emarefa.net/detail/BIM-1417230

American Medical Association (AMA)

Mualla, Rasha& al-Khayr, Jafar& Sulayman, Samir. Using image pre-classification to improve the accuracy of the image captioning systems. Damascus University Journal of Engineering Sciences. 2021. Vol. 37, no. 2, pp.111-124.
https://search.emarefa.net/detail/BIM-1417230

Data Type

Journal Articles

Language

English

Notes

Includes bibliographical references : p. 123-124

Record ID

BIM-1417230