Definition

PixelRNN assumes that images are sampled from an unknown distribution. The model generate images pixel by pixel, treating the image as a sequence. Each pixel is conditioned on all previously generated pixels. The model is trained to maximize the log-likelihood of the training data.
Architecture
The model uses a multi-layer ConvLSTM