Capsule GAN for robust face super resolution

PDF / 1,434,234 Bytes
14 Pages / 439.642 x 666.49 pts Page_size
82 Downloads / 214 Views

Capsule GAN for robust face super resolution Mahdiyar Molahasani Majdabadi1 · Seok-Bum Ko1,2 Received: 26 March 2020 / Revised: 5 July 2020 / Accepted: 28 July 2020 / © Springer Science+Business Media, LLC, part of Springer Nature 2020

Abstract Face hallucination is an emerging sub-field of Super-Resolution (SR) which aims to reconstruct the High-Resolution (HR) facial image given its Low-Resolution (LR) counterpart. The task becomes more challenging when the LR image is extremely small due to the image distortion in the super-resolved results. A variety of deep learning-based approaches has been introduced to address this issue by using attribute domain information. However, a more complex dataset or even further networks is required for training these models. In order to avoid these complexities and yet preserve the precision in reconstructed output, a robust Multi-Scale Gradient capsule GAN for face SR is proposed in this paper. A novel similarity metric called Feature SIMilarity (FSIM) is introduced as well. The proposed network surpassed state-of-the-art face SR systems in all metrics and demonstrates more robust performance while facing image transformations. Keywords Generative Adversarial Network (GAN) · Capsule network · Super resolution · Face hallucination

1 Introduction Face Super-Resolution(SR) is a fast-growing field that aims to enhance the resolution of facial images. These systems attempt to reconstruct High-Resolution (HR) face image from its Low-Resolution (LR) counterpart accurately. Due to the importance of facial details This work is the expansion of “MSG-CapsGAN: Multi-Scale gradient capsule GAN for face super-resolution,” in 2020 International Conference on Electronics, Information, and Communication (ICEIC), Barcelona, Spain, Jan. 2020. Seok-Bum Ko

[email protected] Mahdiyar Molahasani Majdabadi [email protected] 1

Department of Electrical and Computer Engineering, University of Saskatchewan, Saskatoon, Canada

2

Division of Biomedical Engineering, University of Saskatchewan, Saskatoon, Canada

Multimedia Tools and Applications

on human perception, it is vital to preserve these facial details [3]. Face hallucination has widespread and crucial applications in various face-related systems such as face recognition, video surveillance system, and image editing [11]. To reconstruct HR image accurately, several challenges should be overcome. First, for large scale face SR, reconstructing an accurate HR image is an arduous task due to the lack of information in the LR input. Second, it is required that the HR image not only possesses similarity to the ground-truth but also has a photo-realistic appearance and seems natural. Finally, faces can appear in unlimited different poses. Hence, the facial SR system should be pose-invariant to generalize for various situations. There are two categories of learning-based SR systems, local patch-based methods and global methods. In the first category, the system is trained to reconstruct a patch of an image at a time. Rajput and Arya propose mi

Data Loading...

Capsule GAN for robust face super resolution

Recommend Documents

A robust face super-resolution algorithm and its application in low-resolution face recognition system

Fractal Residual Network for Face Image Super-Resolution

Super-Resolution for Iris

Universal super-resolution for face and non-face regions via a facial feature network

Face Super-Resolution Guided by 3D Facial Priors

Super-Resolution

An Improved Technique for Face Age Progression and Enhanced Super-Resolution with Generative Adversarial Networks

Fractional-Grey Wolf optimizer-based kernel weighted regression model for multi-view face video super resolution

Gradual deep residual network for super -resolution

A Lorentzian Stochastic Estimation for a Robust Iterative Multiframe Super-Resolution Reconstruction with Lorentzian-Tik

Robust Speaking Face Identification for Video Analysis

Structured Regularized Robust Coding for Face Recognition