Definition

StarGAN is a GAN model designed for multi-domain image-to-image translation tasks. Unlike previous models that required separate networks for each domain pair, StarGAN can perform image translations across multiple domains using a single generator network.

Architecture

The generator takes an input image and a target domain label to produce the translated image, and the discriminator network that not only distinguishes between real and fake images but also classifies the domain of the input image.

Mask Vector

StarGAN introduces a mask vector to handle datasets with partial domain labels, allowing it to ignore unspecified labels during training.

Objective Function

The objective function of StarGAN consists of three losses: adversarial loss, domain classification loss, and reconstruction loss. The adversarial loss ensures the generated images are realistic, the domain classification loss helps the model learn domain-specific features, and the reconstruction loss ensures the original image can be reconstructed when translating back to the source domain.

Adversarial loss: $L_{adv} (G, D) = E_{x \sim p_{r} (x)} [ln D_{src} (x)] + E_{x \sim p_{r} (x)} [ln (1 - D_{src} (G (x, c)))]$ Domain classification loss: $L_{cls} (G, D) = E_{x \sim p_{r} (x)} [- ln D_{cls} (c^{'} ∣ x)] + E_{x \sim p_{r} (x)} [- ln D_{cls} (c ∣ G (x, c))]$ Reconstruction loss: $L_{rec} (G) = E_{x \sim p_{r} (x)} [∣∣ x - G (G (x, c), c^{'}) ∣ ∣_{1}]$ where:

$G$ is the generator
$D$ is the discriminator
$D_{src}$ is the discriminator’s source prediction
$D_{cls}$ is the discriminator’s domain classification
$p_{r}$ is the distribution of the input data
$c$ is the target domain label.
$c^{'}$ is the original domain label.

The full objective function for StarGAN can be written as: $min_{θ_{G}} max_{θ_{D}} L = L_{adv} (G, D) + λ_{cls} L_{cls} (G, D) + λ_{rec} L_{rec} (G)$ where:

$θ_{G}$ and $θ_{D}$ are the parameters of the generator and discriminator, respectively
$λ_{cls}$ and $λ_{rec}$ are hyperparameters controlling the importance of each loss term

My Knowledge Base

Explorer

StarGAN

Definition

Architecture

Mask Vector

Objective Function

Graph View

Table of Contents

Backlinks