Image-Text Joint Learning for Social Images with Spatial Relation Model

Joint Authors

Feng, Jiangfan
Fu, Xuejun
Zhou, Yao
Zhu, Yuling
Luo, Xiaobo

Source

Complexity

Issue

Vol. 2020, Issue 2020 (31 Dec. 2020), pp.1-11, 11 p.

Publisher

Hindawi Publishing Corporation

Publication Date

2020-03-28

Country of Publication

Egypt

No. of Pages

11

Main Subjects

Philosophy

Abstract EN

The rapid developments in sensor technology and mobile devices bring a flourish of social images, and large-scale social images have attracted increasing attention to researchers.

Existing approaches generally rely on recognizing object instances individually with geo-tags, visual patterns, etc.

However, the social image represents a web of interconnected relations; these relations between entities carry semantic meaning and help a viewer differentiate between instances of a substance.

This article forms the perspective of the spatial relationship to exploring the joint learning of social images.

Precisely, the model consists of three parts: (a) a module for deep semantic understanding of images based on residual network (ResNet); (b) a deep semantic analysis module of text beyond traditional word bag methods; (c) a joint reasoning module from which the text weights obtained using image features on self-attention and a novel tree-based clustering algorithm.

The experimental results demonstrate the effectiveness of using Flickr30k and Microsoft COCO datasets.

Meanwhile, our method considers spatial relations while matching.

American Psychological Association (APA)

Feng, Jiangfan& Fu, Xuejun& Zhou, Yao& Zhu, Yuling& Luo, Xiaobo. 2020. Image-Text Joint Learning for Social Images with Spatial Relation Model. Complexity،Vol. 2020, no. 2020, pp.1-11.
https://search.emarefa.net/detail/BIM-1139917

Modern Language Association (MLA)

Feng, Jiangfan…[et al.]. Image-Text Joint Learning for Social Images with Spatial Relation Model. Complexity No. 2020 (2020), pp.1-11.
https://search.emarefa.net/detail/BIM-1139917

American Medical Association (AMA)

Feng, Jiangfan& Fu, Xuejun& Zhou, Yao& Zhu, Yuling& Luo, Xiaobo. Image-Text Joint Learning for Social Images with Spatial Relation Model. Complexity. 2020. Vol. 2020, no. 2020, pp.1-11.
https://search.emarefa.net/detail/BIM-1139917

Data Type

Journal Articles

Language

English

Notes

Includes bibliographical references

Record ID

BIM-1139917