Text to Realistic Image Generation with Attentional Concatenation Generative Adversarial Networks

تاريخ النشر

2020-10-28

دولة النشر

مصر

عدد الصفحات

التخصصات الرئيسية

الرياضيات

الملخص EN

In this paper, we propose an Attentional Concatenation Generative Adversarial Network (ACGAN) aiming at generating 1024 × 1024 high-resolution images.

First, we propose a multilevel cascade structure, for text-to-image synthesis.

During training progress, we gradually add new layers and, at the same time, use the results and word vectors from the previous layer as inputs to the next layer to generate high-resolution images with photo-realistic details.

Second, the deep attentional multimodal similarity model is introduced into the network, and we match word vectors with images in a common semantic space to compute a fine-grained matching loss for training the generator.

In this way, we can pay attention to the fine-grained information of the word level in the semantics.

Finally, the measure of diversity is added to the discriminator, which enables the generator to obtain more diverse gradient directions and improve the diversity of generated samples.

The experimental results show that the inception scores of the proposed model on the CUB and Oxford-102 datasets have reached 4.48 and 4.16, improved by 2.75% and 6.42% compared to Attentional Generative Adversarial Networks (AttenGAN).

The ACGAN model has a better effect on text-generated images, and the resulting image is closer to the real image.

نمط استشهاد جمعية علماء النفس الأمريكية (APA)

Li, Linyan& Sun, Yu& Hu, Fuyuan& Zhou, Tao& Xi, Xuefeng& Ren, Jinchang. 2020. Text to Realistic Image Generation with Attentional Concatenation Generative Adversarial Networks. Discrete Dynamics in Nature and Society،Vol. 2020, no. 2020, pp.1-10.
https://search.emarefa.net/detail/BIM-1153280

نمط استشهاد الجمعية الأمريكية للغات الحديثة (MLA)

Li, Linyan…[et al.]. Text to Realistic Image Generation with Attentional Concatenation Generative Adversarial Networks. Discrete Dynamics in Nature and Society No. 2020 (2020), pp.1-10.
https://search.emarefa.net/detail/BIM-1153280

نمط استشهاد الجمعية الطبية الأمريكية (AMA)

Li, Linyan& Sun, Yu& Hu, Fuyuan& Zhou, Tao& Xi, Xuefeng& Ren, Jinchang. Text to Realistic Image Generation with Attentional Concatenation Generative Adversarial Networks. Discrete Dynamics in Nature and Society. 2020. Vol. 2020, no. 2020, pp.1-10.
https://search.emarefa.net/detail/BIM-1153280

نوع البيانات

مقالات

لغة النص

الإنجليزية

الملاحظات

Includes bibliographical references

رقم السجل

BIM-1153280

حفظتم الحفظ طباعة

قاعدة معامل التأثير والاستشهادات المرجعية العربي "ارسيف Arcif"

أضخم قاعدة بيانات عربية للاستشهادات المرجعية للمجلات العلمية المحكمة الصادرة في العالم العربي

مرصد "معرفة"
لقياس الإنتاج العلمي العربي

تقوم هذه الخدمة بالتحقق من التشابه أو الانتحال في الأبحاث والمقالات العلمية والأطروحات الجامعية والكتب والأبحاث باللغة العربية، وتحديد درجة التشابه أو أصالة الأعمال البحثية وحماية ملكيتها الفكرية. تعرف اكثر