Sources: Neil Rhodes CS 152 NN—16 GANs: Mode Collapse, Understanding GANs.

Related to GANs.

As a introduction, if you had a GAN, then you know it plays a maximin game

the Generator takes random noise $z \sim p_{z} (z)$ and outputs a fake sample $G (z)$ . The goal is to produce fakes that look real.
The Discriminator takes a sample $x$ and outputs the probability that $x$ is real $D (x) \in [0, 1]$ . The goal is to distinguish real data from fake data.
They are trained simultaneously and in opposition. It is called adversarial training.

L_{G A N} (D, G) = E_{x} [lo g D (x)] + E_{z} [lo g (1 - D (G (z)))]

G^{*} = ar g G min D max L (D, G)

G minimizes: Makes $D (G (z)) \to 1$ , fooling the discriminator
D maximizes: Correctly identifying real ( $lo g D (x) \to 0$ ) and fake ( $lo g (1 - D (G (z))) \to 0$ )

Mode Collapse

Given Generator(G) and wide range of data z

The Generator seems to only output one class or close to one class.
So, it's expressivity is rather limited. It chose to map every noise vector to the same point in data space.

The training process allows the mode collapse to occur.

How to address this?

One technique is called Minibatch Discrimination. Basically, it gives the discriminator information about every sample in the batch as it evaluates each individual sample. This way, the discriminator can learn to detect that points are being generated when they all happen to be very close to one another.

Another solution is Wasserstein GANs(paper link).

🚀 Costin Chitic

Recent Notes

Actor-Critic Methods

Deep Q-Learning

Monte Carlo Learning

Proximal Policy Optimization (PPO)

Q-Learning

Mode Collapse

Mode Collapse

How to address this?

Graph View

Table of Contents

Backlinks