Improved semantic inpainting architecture augmented with a facial landmark detector

المؤلفون المشاركون

Sami, Mirza
Naiyer, Israt
Khan, Ehsanul
Uddin, Jia

المصدر

The International Arab Journal of Information Technology

العدد

المجلد 19، العدد 3 (31 مايو/أيار 2022)، ص ص. 353-362، 10ص.

الناشر

جامعة الزرقاء عمادة البحث العلمي

تاريخ النشر

2022-05-31

دولة النشر

الأردن

عدد الصفحات

10

التخصصات الرئيسية

تكنولوجيا المعلومات وعلم الحاسوب

الملخص EN

This paper presents an augmented method for image completion, particularly for images of human faces by leveraging on deep learning based inpainting techniques.

Face completion generally tend to be a daunting task because of the relatively low uniformity of a face attributed to structures like eyes, nose, etc.

Here, understanding the top level context is paramount for proper semantic completion.

The method presented improves upon existing inpainting techniques that reduce context difference by locating the closest encoding of the damaged image in the latent space of a pre-trained deep generator.

However, these existing methods fail to consider key facial structures (eyes, nose, jawline, etc.,) and their respective location to each other.

This paper mitigates this by introducing a face landmark detector and a corresponding landmark loss.

This landmark loss is added to the construction loss between the damaged and generated image and the adversarial loss of the generative model.

The model was trained with the celeb A dataset, tools like pyamg, pillow and the OpenCV library was used for image manipulation and facial landmark detection.

There are three main weighted parameters that balance the effect of the three loss functions in this paper, namely context loss, landmark loss and prior loss.

Experimental results demonstrate that the added landmark loss attributes to better understanding of top-level context and hence the model can generate more visually appealing in painted images than the existing model.The model obtained average Structural Similarity Index (SSIM) and Peak Signal-to-Noise Ratio (PNSR) scores of 0.851 and 33.448 for different orientations of the face and 0.896 and 31.473, respectively, for various types masks.

نمط استشهاد جمعية علماء النفس الأمريكية (APA)

Sami, Mirza& Naiyer, Israt& Khan, Ehsanul& Uddin, Jia. 2022. Improved semantic inpainting architecture augmented with a facial landmark detector. The International Arab Journal of Information Technology،Vol. 19, no. 3, pp.353-362.
https://search.emarefa.net/detail/BIM-1437354

نمط استشهاد الجمعية الأمريكية للغات الحديثة (MLA)

Sami, Mirza…[et al.]. Improved semantic inpainting architecture augmented with a facial landmark detector. The International Arab Journal of Information Technology Vol. 19, no. 3 (May. 2022), pp.353-362.
https://search.emarefa.net/detail/BIM-1437354

نمط استشهاد الجمعية الطبية الأمريكية (AMA)

Sami, Mirza& Naiyer, Israt& Khan, Ehsanul& Uddin, Jia. Improved semantic inpainting architecture augmented with a facial landmark detector. The International Arab Journal of Information Technology. 2022. Vol. 19, no. 3, pp.353-362.
https://search.emarefa.net/detail/BIM-1437354

نوع البيانات

مقالات

لغة النص

الإنجليزية

رقم السجل

BIM-1437354