Toward Ordinal Image-to-Image Translation with Generative Adversarial Networks

University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

Author: David Stevens; [2021]

Keywords: ;

Abstract: The field of image-to-image translation consists of learning a transformation of an image from one domain to another. It has experienced great success during recent years, with methods being able to generate realistic outputs when converting between multiple categorical domains at once. However, existing approaches have not yet been extended to ordinal domains. Therefore, this thesis investigates how existing image-to-image translation methods can be extended to use ordinal labels and introduces the Ordinal GAN (OrGAN) architecture as one possible solution to the problem. OrGAN is based on two fundamental modifications to existing methods, namely, adding Gaussian noise to the labels of the data samples in each iteration of training and using a pre-trained embedding network to feed embedded labels into the network instead of scalars. The effectiveness of the model is demonstrated empirically in a variety of synthetic data sets and compared to a direct application of an established work in categorical image-to-image translation. This shows that the presented methodology is a suitable starting point for future work within the field. The full source code of the thesis is available at https://github.com/ stevensdavid/OrGAN 

  AT THIS PAGE YOU CAN DOWNLOAD THE WHOLE ESSAY. (follow the link to the next page)