Using image pre-classification to improve the accuracy of the image captioning systems
Other Title(s)
استخدام التصنيف المسبق للصور لتحسين دقة أنظمة وصف الصور
Joint Authors
al-Khayr, Jafar
Sulayman, Samir
Mualla, Rasha
Source
Damascus University Journal of Engineering Sciences
Issue
Vol. 37, Issue 2 (30 Jun. 2021), pp.111-124, 14 p.
Publisher
Publication Date
2021-06-30
Country of Publication
Syria
No. of Pages
14
Main Subjects
Information Technology and Computer Science
Abstract EN
Deep learning for the purpose of image description and captioning has been one of the most promised computer science application recently.
it consists of two parts; the image and the text description models.
in previous researches, we studied the effect of using different languages and datasets on the image description models.
in this paper, we study the classification effect of the image dataset on those models.
so, a new combined 12000-images dataset consisting of two international datasets (Flickr2k and MS-COCO) is built.
the designed models support Arabic and English languages.
for the description part, we used two different scenarios.
in the first scenario, we used CNN and LSTM models.
while for the second one, resnet50 and fasttext are used as image and text models respectively.
the training is applied for both indoor and outdoor classes.
tests scenarios are applied in two cases and four ways which are the word-by-word and the sentence-by-sentence models.
the performance analysis proves that classified classes have a higher performance than the unclassified ones in case of repeating-based and non-repeating-based datasets in all scenarios.
American Psychological Association (APA)
Mualla, Rasha& al-Khayr, Jafar& Sulayman, Samir. 2021. Using image pre-classification to improve the accuracy of the image captioning systems. Damascus University Journal of Engineering Sciences،Vol. 37, no. 2, pp.111-124.
https://search.emarefa.net/detail/BIM-1417230
Modern Language Association (MLA)
al-Khayr, Jafar…[et al.]. Using image pre-classification to improve the accuracy of the image captioning systems. Damascus University Journal of Engineering Sciences Vol. 37, no. 2 (2021), pp.111-124.
https://search.emarefa.net/detail/BIM-1417230
American Medical Association (AMA)
Mualla, Rasha& al-Khayr, Jafar& Sulayman, Samir. Using image pre-classification to improve the accuracy of the image captioning systems. Damascus University Journal of Engineering Sciences. 2021. Vol. 37, no. 2, pp.111-124.
https://search.emarefa.net/detail/BIM-1417230
Data Type
Journal Articles
Language
English
Notes
Includes bibliographical references : p. 123-124
Record ID
BIM-1417230