Informative Multimodal Unsupervised Image-to-Image Translation

Tien Tai Doan; Guillaume Ghyselinck; Blaise Hanczar; Tien Tai Doan, Guillaume Ghyselinck and Blaise Hanczar; Tien Tai Doan; Guillaume Ghyselinck; Blaise Hanczar

doi:10.5121/csit.2021.110503

Volume 11, Number 05, April 2021

Informative Multimodal Unsupervised Image-to-Image Translation

Authors

Tien Tai Doan^1,2, Guillaume Ghyselinck¹ and Blaise Hanczar², ¹Dental Monitoring, France, ²University of Evry Val d’Essonne, France

Abstract

We propose a new method of multimodal image translation, called InfoMUNIT, which is an extension of the state-of-the-art method MUNIT. Our method allows controlling the style of the generated images and improves their quality and diversity. It learns to maximize the mutual information between a subset of style code and the distribution of the output images. Experiments show that our model cannot only translate one image from the source domain to multiple images in the target domain but also explore and manipulate features of the outputs without annotation. Furthermore, it achieves a superior diversity and a competitive image quality to state-of-the-art methods in multiple image translation tasks.

Keywords

Multimodal Image-to-Image Translation, Mutual Information, GANs, Manipulating Features, Disentangled Representation.

Subscription Membership AIRCC CSCP Contact Us
All Rights Reserved ® AIRCC