Definition

StarGAN is a GAN model designed for multi-domain image-to-image translation tasks. Unlike previous models that required separate networks for each domain pair, StarGAN can perform image translations across multiple domains using a single generator network.
Architecture
The generator takes an input image and a target domain label to produce the translated image, and the discriminator network that not only distinguishes between real and fake images but also classifies the domain of the input image.
Mask Vector
StarGAN introduces a mask vector to handle datasets with partial domain labels, allowing it to ignore unspecified labels during training.
Objective Function
The objective function of StarGAN consists of three losses: adversarial loss, domain classification loss, and reconstruction loss. The adversarial loss ensures the generated images are realistic, the domain classification loss helps the model learn domain-specific features, and the reconstruction loss ensures the original image can be reconstructed when translating back to the source domain.
Adversarial loss: Domain classification loss: Reconstruction loss: where:
- is the generator
- is the discriminator
- is the discriminator’s source prediction
- is the discriminator’s domain classification
- is the distribution of the input data
- is the target domain label.
- is the original domain label.
The full objective function for StarGAN can be written as: where:
- and are the parameters of the generator and discriminator, respectively
- and are hyperparameters controlling the importance of each loss term