0% found this document useful (0 votes)

109 views67 pages

GANs: Concepts and Training Methods

Generative adversarial networks (GANs) are a class of machine learning frameworks where two neural networks compete against each other in a game. One network generates new data instances, while the other evaluates them for authenticity. They are trained using an adversarial process where the generating network incorporates feedback from the discriminating network to improve quality of new instances, while the discriminating network is improving at detecting fakes from reals. GANs have been used to generate highly realistic images, videos, text and more.

Uploaded by

GopiNath Velivela

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

109 views67 pages

GANs: Concepts and Training Methods

Uploaded by

GopiNath Velivela

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

ONE WEEK ATAL FDP

r
on
a
j Convolutional Neural Networks with Generative Adversarial
k Networks
u
m
a
r Conceptual View of Generative Adversarial
s
Networks
@
v
i
13th December 2023 (Wednesday)
t
.
a Organized by
c
Department of Information Technology
.
i
Sir C R Reddy College of Engineering,
n Eluru, Andhra Pradesh, India.

[Link]/view/rajkumars1987
Dr. Rajkumar S
School of Computer Science and Engineering
Vellore Institute of Technology (Vellore)
Tamil Nadu, India – 632 014.
rajkumarsrajkumar@[Link]
Presentation Outline

Introduction to GANs

Some Challenges with GANs

Applications of GAN

Advanced GAN Extensions

Demo
Introduction to GAN

r
a
j
k
u
m
a “This (GANS), and the variations that are now being
r proposed is the most interesting idea in the last 10 years in
s
@ ML, in my opinion”
v
i
t
. –Yann LeCun
a
c
.
i
n
Introduction to GAN

r
a
j  GAN was first introduced by Ian Goodfellow et al in 2014
k
u  Have been used in generating images, videos, poems,
m some simple conversation.
a
r  Note, image processing is easy (all animals can do it), NLP
s
@
is hard (only human can do it)
v
i
t
.
a
c Ian Goodfellow:
. [Link]
i Radford, (generate voices also here)
n
[Link]
Tips for training GAN: [Link]
WHAT ARE GANS?

r
a
j › Generative Adversarial Networks
k
u
m
a
r
s
@
v
i
t
.
a
c
.
i
n
WHAT ARE GANS?

r
a › Generative Adversarial Networks
j
k
u
m
a
r
s
Generative Models
@
v
We try to learn the underlying the distribution
i from which our dataset comes from.
t
. Eg:VariationalAutoEncoders(VAE)
a
c
.
i
n
WHAT ARE GANS?

r
a › Generative Adversarial Networks
j
k
u
m
a
r
s
AdversarialTraining
@
v
GANS are made up of two
i competing networks
t
. (adversaries) that are trying beat
a
c
each other.
.
i
n
WHAT ARE GANS?

r
a › Generative Adversarial Networks
j
k
u
m
a
r Neural Networks
s
@
v
i
t
.
a
c
.
i
n
WHAT ARE GANS?

r
a › Generative Adversarial Networks
j
k
u
m
a
r Neural Networks
s
@
v
i
t
.
a
c
.
i
n
W HATAREGANS?

Generative Adversarial Networks

Generative Models Neural Networks

We try to learn the underlying the distribution
from which our dataset comes from.
Eg:Variational AutoEncoders (VAE)

Adversarial Training
GANS are made up of two competing networks (adversaries) that
are trying beat each other.
W HATAREGANS?

Generated
P(z) Generator
Data
W HATAREGANS?

Generated
P(z) Generator
Data

Discriminator Real/Fake?
W HATAREGANS?

Generated
P(z) Generator
Data

Discriminator Real/Fake?

Real
Data
HOW TOTRAINA GAN?
HOW TOTRAINA GAN?

At t = 0,

Latent Generated (fake image)

Generator
Vector Image

(fake data) Generated

Data

Discriminator Real/Fake?
Given
(Real data)
Trainin
g Data
HOW TOTRAINA GAN?

At t = 0,

Latent Generated (fake image)

Generator
Vector Image

Binary
Classifier
(fake data) Generated
Data

Discriminator Real/Fake?
Given
(Real data)
Trainin
g Data
HOW TO TRAIN A GAN?
Which network should I train first?
HOW TOTRAINA GAN?

Which network should I train first?

Discriminator!
HOW TOTRAINA GAN?
› Which network should I train first?
› Discriminator!

› But with what training data?

HOW TOTRAINA GAN?

Which network should I train first?

Discriminator!

But with what training data?

The Discriminator is a Binary classifier.
The Discriminator has two class - Real and Fake.
The data for Real class if already given:THETRAINING DATA
The data for Fake class?-> generate from the Generator
HOW TOTRAINA GAN?

What’s next?-> Train the Generator

But how?What’s our training objective?

HOW TOTRAINA GAN?

What’s next?-> Train the Generator

But how?What’s our training objective?

Generate images from the Generator
such that they are classified incorrectly by the Discriminator!
HOW TOTRAINA GAN?

Discriminator

Step 1:
Train the Discriminator
using the current ability
of the Generator.
HOW TOTRAINA GAN?

Discriminator Generator

Step 1: Step 2:
Train the Discriminator Train the Generator
using the current ability to beat
of the Generator. the Discriminator.
HOW TOTRAINA GAN?

Discriminator Generator

Step 1: Step 2:
Train the Discriminator Train the Generator
using the current ability to beat
of the Generator. the Discriminator.

Generate images from the Generator

such that they are classified incorrectly by the Discriminator!
HOW TOTRAINA GAN?

Discriminator Generator

Step 1: Step 2:
Train the Discriminator Train the Generator
using the current ability to beat
of the Generator. the Discriminator.
Why Generative Models?

• We’ve only seen discriminative models so far

• Given an image X, predict a label Y
• Estimates P(Y|X)

• Discriminative models have several key limitations

• Can’t model P(X), i.e. the probability of seeing a certain image
• Thus, can’t sample from P(X), i.e. can’t generate new images

• Generative models (in general) cope with all of above

• Can model P(X)
• Can generate new images
Magic of GANs…

Lotter, William, Gabriel Kreiman,and David Cox. "Unsupervised learning of visual structure using predictive generativenetworks." arXiv preprint arXiv:1511.06380 (2015).
Magic of GANs…

Which one is Computer generated?

Ledig, Christian,et al. "Photo-realistic single image super-resolution using a generative adversarial network." arXiv preprintarXiv:1609.04802 (2016).
Magic of GANs…

[Link]
Adversarial Training
• we saw:
• We can generate adversarial samples to fool a discriminative model
• We can use those adversarial samples to make models robust
• We then require more effort to generate adversarial samples
• Repeat this and we get better discriminative model

• GANs extend that idea to generative models:

• Generator: generate fake samples, tries to fool the Discriminator
• Discriminator: tries to distinguish between real and fake samples
• Train them against each other
• Repeat this and we get better Generator and Discriminator
GAN’s
Architecture

D D(x)

G
z
G(z)
D(G(z))

• Z is some random noise (Gaussian/Uniform).

• Z can be thought as the latent representation of the image.
[Link]
Training Discriminator

[Link]
Training Generator

[Link]
GAN’s formulation

min max 𝑉 𝐷, 𝐺
𝐺 𝐷

• It is formulated as a minimax game, where:

• The Discriminator is trying to maximize its reward 𝑽 𝑫, 𝑮
• The Generator is trying to minimize Discriminator’s reward (or maximize its loss)

𝑉 𝐷, 𝐺 = D𝑥∼𝑝(𝑥) log 𝐷 𝑥 + D𝑧∼𝑞(𝑧) log 1 − 𝐷 𝐺(𝑧)

• The Nash equilibrium of this particular game is achieved at:

• 𝑃𝑑𝑎𝑡𝑎𝑥 = 𝑃D𝑒𝑛 𝑥 ∀𝑥
1
• D 𝑥 = ∀𝑥
2
Discriminator
updates

Generator
updates
Vanishing gradient strikes back again…
min max 𝑉 𝐷, 𝐺
𝐺 𝐷
𝑉 𝐷, 𝐺 = D𝑥∼𝑝(𝑥) log𝐷 𝑥 + D𝑧∼𝑞(𝑧) log 1 − 𝐷 𝐺(𝑧)

𝛻𝜃𝘎 𝑉(𝐷, 𝐺) = 𝛻𝜃𝘎 D𝑧~𝑞(𝑧) log 1 − 𝐷 𝐺 𝑧

• 𝛻𝑎 log 1 − 𝜎 𝑎 = –𝛻 𝑎 T(𝑎) = –T 𝑎 1–T 𝑎

= −𝜎 𝑎 = −𝐷 𝐺 𝑧
1 – T(𝑎) 1 – T(𝑎)

• Gradient goes to 0 if 𝐷 is confident, i.e. 𝐷 𝐺 𝑧 →0

• Minimize −D𝑧~𝑞 𝑧 log 𝐷 𝐺 𝑧 for Generator instead (keep Discriminator as it is)

Faces

Goodfellow, Ian, et al. "Generativeadversarial nets." Advances in neural information processingsystems.2014.

DCGAN: Bedroom
images

Radford,Alec, Luke Metz, and Soumith Chintala. "Unsupervised representationlearning with deep convolutional generativeadversarial networks." arXiv:1511.06434 (2015).
Deep Convolutional GANs
(DCGANs)

Key ideas:
• Replace FC hidden layers with
Generator Architecture Convolutions
• Generator: Fractional-Strided
convolutions

• Use Batch Normalization after

each layer

• Inside Generator
• Use ReLU for hidden layers
• Use Tanh for the output layer

Radford,Alec, Luke Metz, and SoumithChintala. "Unsupervised representationlearning with deep convolutional generativeadversarial networks." arXiv:1511.06434 (2015).
Advantages of GANs
• Plenty of existing work on Deep Generative Models
• Boltzmann Machine
• Deep Belief Nets
• Variational AutoEncoders (VAE)

• Why GANs?
• Sampling (or generation) is straightforward.
• Training doesn't involve Maximum Likelihood estimation.
• Robust to Overfitting since Generator never sees the training data.
• Empirically, GANs are good at capturing the modes of the distribution.

Goodfellow, Ian. "NIPS 2016 Tutorial: Generative Adversarial Networks." arXiv preprint arXiv:1701.00160 (2016).
Problems with GANs
• Probability Distribution is Implicit
• Not straightforward to compute P(X).
• Thus Vanilla GANs are only good for Sampling/Generation.

• Training is Hard
• Non-Convergence
• Mode-Collapse

Goodfellow, Ian. "NIPS 2016 Tutorial: Generative Adversarial Networks." arXiv preprint arXiv:1701.00160 (2016).
Training Problems

• Non-Convergence
• Mode-Collapse
• Deep Learning models (in general) involve a single player
• The player tries to maximize its reward (minimize its loss).
• Use SGD (with Backpropagation) to find the optimal parameters.
• SGD has convergence guarantees (under certain conditions).
• Problem: With non-convexity, we might converge to local optima.

min
𝐺
𝐿𝐺

• GANs instead involve two (or more) players

• Discriminator is trying to maximize its reward.
• Generator is trying to minimize Discriminator’s reward.
min max 𝑉 𝐷, 𝐺
𝐺 𝐷

• SGD was not designed to find the Nash equilibrium of a game.

• Problem: We might not converge to the Nash equilibrium at all.

Salimans, Tim, et al. "Improved techniques for training gans." Advances in Neural Information ProcessingSystems.2016.
Non-Convergence
min max 𝑉 𝑥, 𝑦
𝑥 𝑦
Let 𝑉 𝑥, 𝑦 = 𝑥𝑦

• State 1: x>0 y>0 V> 0 Increase y Decrease x

• State 2: x<0 y>0 V< 0 Decrease y Decrease x

• State 3: x<0 y<0 V> 0 Decrease y Increase x

• State 4 : x>0 y<0 V< 0 Increase y Increase x

• State 5: x>0 y>0 V> 0 == State 1 Increase y Decrease x

Mode-Collapse
• Generator fails to output diverse samples

Target

Expected

Output

Metz, Luke, et al. "Unrolled Generative AdversarialNetworks." arXiv preprint arXiv:1611.02163 (2016).
Some Solutions
• Mini-Batch GANs
• Supervision with labels
Basic (Heuristic) Solutions
• Mini-Batch GANs
• Supervision with labels
How to reward sample diversity?

• At Mode Collapse,
• Generator produces good samples, but a very few of them.
• Thus, Discriminator can’t tag them as fake.

• To address this problem,

• Let the Discriminator know about this edge-case.

• More formally,
• Let the Discriminator look at the entire batch instead of single examples
• If there is lack of diversity, it will mark the examples as fake

• Thus,
• Generator will be forced to produce diverse samples.
Salimans, Tim, et al. "Improved techniques for training gans." Advances in Neural Information ProcessingSystems.2016.
Mini-Batch GANs
• Extract features that capture diversity in the mini-batch
• For e.g. L2 norm of the difference between all pairs from the batch

• Feed those features to the discriminator along with the image

• Feature values will differ b/w diverse and non-diverse batches

• Thus, Discriminator will rely on those features for classification

• This in turn,
• Will force the Generator to match those feature values with the real data
• Will generate diverse batches

Salimans, Tim, et al. "Improved techniques for training gans." Advances in Neural Information ProcessingSystems.2016.
Basic (Heuristic) Solutions
• Mini-Batch GANs
• Supervision with labels
Supervision with Labels
• Label information of the real data might help
Car

Dog

Real

D D Human
Fake
Fake

• Empirically generates much better samples

Salimans, Tim, et al. "Improved techniques for training gans." Advances in Neural Information ProcessingSystems.2016.
Alternate view of GANs

min max 𝑉 𝐷, 𝐺
𝐺 𝐷
𝑉 𝐷, 𝐺 = D𝑥∼𝑝(𝑥) log 𝐷 𝑥 + D𝑧∼𝑞(𝑧) log 1 − 𝐷 𝐺(𝑧)

𝐷∗ = 𝑎𝑟𝑔max 𝑉 𝐷, 𝐺 𝐺∗ = 𝑎𝑟𝑔m𝑖𝑛 𝑉 𝐷, 𝐺
𝐷 𝐺

• In this formulation, Discriminator’s strategy was 𝐷 𝑥 → 1, 𝐷 𝐺 𝑧 →0

• Alternatively, we can flip the binary classification labels i.e. Fake = 1, Real = 0

𝑉 𝐷, 𝐺 = D𝑥∼𝑝(𝑥) log 1 − 𝐷 𝑥 + D𝑧∼𝑞(𝑧) log 𝐷 𝐺(𝑧)

• In this new formulation, Discriminator’s strategy will be 𝐷 𝑥 → 0, 𝐷 𝐺 𝑧 →1

Zhao, Junbo, Michael Mathieu, and Yann LeCun. "Energy-basedgenerative adversarial network." arXiv preprint arXiv:1609.03126 (2016)
Alternate view of GANs (Contd.)
• If all we want to encode is 𝐷 𝑥 → 0, 𝐷 𝐺 𝑧 →1

𝐷∗ = 𝑎𝑟𝑔𝑚𝑎𝑥𝐷 D 𝑥∼𝑝(𝑥) log 1 − 𝐷 𝑥 + D𝑧∼𝑞(𝑧) log 𝐷 𝐺(𝑧)

We can use this 𝐷∗ = 𝑎𝑟𝑔𝑚𝑖𝑛𝐷 D𝑥∼𝑝 𝑥 log 𝐷 𝑥 + D𝑧∼𝑞(𝑧) log 1 − 𝐷 𝐺 𝑧

• Now, we can replace cross-entropy with any loss function (Hinge Loss)

𝐷∗ = 𝑎𝑟𝑔𝑚𝑖𝑛𝐷 D𝑥∼𝑝 𝑥 𝐷 𝑥 + D𝑧∼𝑞 𝑧 max 0, 𝑚 − 𝐷 𝐺 𝑧

• And thus, instead of outputting probabilities, Discriminator just has to output :-
• High values for fake samples
• Low values for real samples

Zhao, Junbo, Michael Mathieu, and Yann LeCun. "Energy-basedgenerative adversarial network." arXiv preprint arXiv:1609.03126 (2016)
Energy-Based GANs

• Modified game plans

• Generator will try to generate samples with
low values 𝐷 𝑥 = ||𝐷𝑒𝑐 𝐸𝑛𝑐 𝑥 − 𝑥||𝑀𝑆𝐸
• Discriminator will try to assign high scores to
fake values

• Use AutoEncoder inside the Discriminator

• Use Mean-Squared Reconstruction error as 𝑫 𝒙

• High Reconstruction Error for Fake samples
• Low Reconstruction Error for Real samples

Zhao, Junbo, Michael Mathieu, and Yann LeCun. "Energy-basedgenerative adversarial network." arXiv preprint arXiv:1609.03126 (2016)
More Bedrooms

More Bedrooms…

Zhao, Junbo, Michael Mathieu, and Yann LeCun. "Energy-basedgenerative adversarial network." arXiv preprint arXiv:1609.03126 (2016)
More Celebs..

Celebs…
GAN Applications

› Image-to-Image Translation
r
a
j
› Text-to-Image Synthesis
k
u › Face Aging
m
a
r
s
@
v
i
t
.
a
c
.
i
n
Image-to-Image Translation

Figure 1 in the original paper.

Link to an interactive demo of this paper

Isola, P., Zhu, J. Y., Zhou, T.,& Efros, A. A. “Image-to-image translation with conditional adversarial networks”. arXiv preprint arXiv:1611.07004. (2016).
Image-to-Image Translation
• Architecture: DCGAN-based
architecture

• Training is conditioned on the images

from the source domain.

• Conditional GANs provide an effective

way to handle many complex domains
without worrying about designing Figure 2 in the original paper.
structured loss functions explicitly.
Isola, P., Zhu, J. Y., Zhou, T.,& Efros, A. A. “Image-to-image translation with conditional adversarial networks”. arXiv preprint arXiv:1611.07004. (2016).
Text-to-Image Synthesis
Motivation

Given a text description, generate

images closely associated.

Uses a conditional GAN with the

generator and discriminator being
condition on “dense” text
embedding.
Figure 1 in the original paper.

Reed,S., Akata,Z., Yan, X., Logeswaran,L.,Schiele, B., & Lee,H. “Generative adversarial text to image synthesis”. ICML (2016).
Text-to-Image Synthesis

Figure 2 in the original paper.

Positive Example: Negative Examples:

Real Image, Right Text Real Image, Wrong Text
Fake Image, Right Text
Reed,S., Akata,Z., Yan, X., Logeswaran,L.,Schiele, B., & Lee,H. “Generative adversarial text to image synthesis”. ICML (2016).
Face Aging with Conditional
GANs
• Differentiating Feature: Uses an Identity Preservation Optimization using an
auxiliary network to get a better approximation of the latent code (z*) for an
input image.
• Latent code is then conditioned on a discrete (one-hot) embedding of age
categories.

Figure 1 in the original paper.

Antipov, G., Baccouche, M., & Dugelay, J. L. (2017). “FaceAging With Conditional Generative AdversarialNetworks”. arXiv preprint arXiv:1702.01983.
Face Aging with Conditional
GANs

Figure 3 in the original paper.

Antipov, G., Baccouche, M., & Dugelay, J. L. (2017). “FaceAging With Conditional Generative AdversarialNetworks”. arXiv preprint arXiv:1702.01983.

GAN Training Strategies and Variants
No ratings yet
GAN Training Strategies and Variants
57 pages
Understanding Generative Adversarial Networks
No ratings yet
Understanding Generative Adversarial Networks
26 pages
Deep Learning Insights on GANs
No ratings yet
Deep Learning Insights on GANs
75 pages
Understanding Generative Adversarial Networks
No ratings yet
Understanding Generative Adversarial Networks
15 pages
Giovanni Iacca on Generative Models
No ratings yet
Giovanni Iacca on Generative Models
51 pages
Understanding Generative Adversarial Networks
No ratings yet
Understanding Generative Adversarial Networks
8 pages
Understanding Generative Adversarial Networks
No ratings yet
Understanding Generative Adversarial Networks
25 pages
Types of Deep Learning Models
No ratings yet
Types of Deep Learning Models
25 pages
Introduction to Generative Adversarial Networks
No ratings yet
Introduction to Generative Adversarial Networks
83 pages
Understanding Generative Adversarial Networks
No ratings yet
Understanding Generative Adversarial Networks
15 pages
Understanding Generative AI Techniques
No ratings yet
Understanding Generative AI Techniques
69 pages
Understanding Generative Adversarial Networks
No ratings yet
Understanding Generative Adversarial Networks
21 pages
Understanding Generative Adversarial Networks
No ratings yet
Understanding Generative Adversarial Networks
30 pages
Understanding Generative Adversarial Networks
No ratings yet
Understanding Generative Adversarial Networks
9 pages
Understanding Generative Adversarial Networks
No ratings yet
Understanding Generative Adversarial Networks
22 pages
Understanding Generative Adversarial Networks
No ratings yet
Understanding Generative Adversarial Networks
24 pages
Understanding Generative Adversarial Networks
No ratings yet
Understanding Generative Adversarial Networks
24 pages
Understanding GAN Specialization
No ratings yet
Understanding GAN Specialization
20 pages
Understanding Generative Adversarial Networks
No ratings yet
Understanding Generative Adversarial Networks
61 pages
Understanding Generative Adversarial Networks
No ratings yet
Understanding Generative Adversarial Networks
20 pages
Understanding Generative Adversarial Networks
No ratings yet
Understanding Generative Adversarial Networks
17 pages
Understanding GANs and VAEs in Deep Learning
No ratings yet
Understanding GANs and VAEs in Deep Learning
33 pages
Generative Modeling in Machine Learning
No ratings yet
Generative Modeling in Machine Learning
10 pages
Deep Learning: LSTM and GANs Explained
No ratings yet
Deep Learning: LSTM and GANs Explained
18 pages
Understanding GANs and Their Applications
No ratings yet
Understanding GANs and Their Applications
21 pages
Introduction to Generative AI Models
No ratings yet
Introduction to Generative AI Models
13 pages
Overview of Generative Adversarial Networks
50% (4)
Overview of Generative Adversarial Networks
11 pages
Understanding Generative Adversarial Networks
No ratings yet
Understanding Generative Adversarial Networks
46 pages
Generative Models: A Comprehensive Taxonomy
No ratings yet
Generative Models: A Comprehensive Taxonomy
88 pages
Understanding Generative Adversarial Networks
No ratings yet
Understanding Generative Adversarial Networks
37 pages
Understanding Generative Models and GANs
No ratings yet
Understanding Generative Models and GANs
81 pages
Overview of GAN Types and Challenges
No ratings yet
Overview of GAN Types and Challenges
8 pages
Generative Models and GAN Fundamentals
No ratings yet
Generative Models and GAN Fundamentals
12 pages
A Survey On Generative Adversarial Networks (GANs)
No ratings yet
A Survey On Generative Adversarial Networks (GANs)
5 pages
GANs: Overview and Applications
No ratings yet
GANs: Overview and Applications
12 pages
Understanding Generative Adversarial Networks
No ratings yet
Understanding Generative Adversarial Networks
19 pages
Overview of Generative Adversarial Networks
No ratings yet
Overview of Generative Adversarial Networks
24 pages
GANs: Applications and Challenges
No ratings yet
GANs: Applications and Challenges
24 pages
Understanding Generative Adversarial Networks
No ratings yet
Understanding Generative Adversarial Networks
47 pages
Beginner's Guide to GAN Architectures
No ratings yet
Beginner's Guide to GAN Architectures
9 pages
GANs: Training and Applications Overview
No ratings yet
GANs: Training and Applications Overview
37 pages
GANs for Image and Video Synthesis
No ratings yet
GANs for Image and Video Synthesis
22 pages
Understanding Generative Adversarial Networks
No ratings yet
Understanding Generative Adversarial Networks
20 pages
Understanding GANs: Architecture & Applications
No ratings yet
Understanding GANs: Architecture & Applications
44 pages
Understanding Generative Adversarial Networks
No ratings yet
Understanding Generative Adversarial Networks
38 pages
Understanding Generative Adversarial Networks
No ratings yet
Understanding Generative Adversarial Networks
29 pages
Understanding Generative Adversarial Networks
No ratings yet
Understanding Generative Adversarial Networks
39 pages
Understanding Generative Adversarial Networks
No ratings yet
Understanding Generative Adversarial Networks
13 pages
Generative Modeling Techniques Overview
No ratings yet
Generative Modeling Techniques Overview
111 pages
Overview of GAN Variants and Applications
No ratings yet
Overview of GAN Variants and Applications
8 pages
Understanding Generative Adversarial Networks
No ratings yet
Understanding Generative Adversarial Networks
36 pages
Understanding Generative Adversarial Networks
No ratings yet
Understanding Generative Adversarial Networks
16 pages
Advanced Generative Adversarial Networks
No ratings yet
Advanced Generative Adversarial Networks
60 pages
Overview of Generative Adversarial Networks
No ratings yet
Overview of Generative Adversarial Networks
31 pages
Understanding Generative Adversarial Networks
No ratings yet
Understanding Generative Adversarial Networks
41 pages
Understanding Generative Adversarial Networks
No ratings yet
Understanding Generative Adversarial Networks
10 pages
Understanding Generative Models and GANs
No ratings yet
Understanding Generative Models and GANs
39 pages
Module 1 Practice
No ratings yet
Module 1 Practice
7 pages
Student Roll Numbers by Branch
No ratings yet
Student Roll Numbers by Branch
8 pages
Bloom's Taxonomy in Education Explained
No ratings yet
Bloom's Taxonomy in Education Explained
69 pages
SkillDzire AI Internship Details
No ratings yet
SkillDzire AI Internship Details
5 pages
Overview of CNN and GAN in AI
100% (1)
Overview of CNN and GAN in AI
58 pages
B.Tech Project Panels Dec 2023
No ratings yet
B.Tech Project Panels Dec 2023
2 pages
B.Tech I/IV Course Structure & Syllabus
No ratings yet
B.Tech I/IV Course Structure & Syllabus
96 pages
B.Tech IT 3rd Year Syllabus Overview
No ratings yet
B.Tech IT 3rd Year Syllabus Overview
24 pages
AISG ES-RAB V 2.1.0
No ratings yet
AISG ES-RAB V 2.1.0
21 pages
Sets, Functions and Logic - Keith J. Devlin PDF
100% (1)
Sets, Functions and Logic - Keith J. Devlin PDF
99 pages
Testing UTP Cables with Fluke 620
No ratings yet
Testing UTP Cables with Fluke 620
2 pages
AccessCat Iss 2 12
No ratings yet
AccessCat Iss 2 12
242 pages
Spiral Curve Staking and Computation Guide
No ratings yet
Spiral Curve Staking and Computation Guide
8 pages
Stepping Stone Method in Transportation
83% (6)
Stepping Stone Method in Transportation
25 pages
Grade 8 Science Curriculum Overview
No ratings yet
Grade 8 Science Curriculum Overview
446 pages
25UM58 Tech Spec 2016
No ratings yet
25UM58 Tech Spec 2016
2 pages
Data Entry Basics and Best Practices
No ratings yet
Data Entry Basics and Best Practices
101 pages
Enthalpy of Neutralization and Ionization
No ratings yet
Enthalpy of Neutralization and Ionization
8 pages
PLL Cases Overview and Algorithms
No ratings yet
PLL Cases Overview and Algorithms
15 pages
VLSI Static Timing Analysis: Setup Timing
No ratings yet
VLSI Static Timing Analysis: Setup Timing
45 pages
Bartec Hygrophil-F Moisture Analyzer
No ratings yet
Bartec Hygrophil-F Moisture Analyzer
11 pages
Learning Numbers 1 to 10
No ratings yet
Learning Numbers 1 to 10
2 pages
Real-Time Telemetry in Oil Drilling
No ratings yet
Real-Time Telemetry in Oil Drilling
18 pages
Understanding Absolute Cell References
No ratings yet
Understanding Absolute Cell References
2 pages
SwarmAgentic: Automated Agentic System Generation
No ratings yet
SwarmAgentic: Automated Agentic System Generation
41 pages
Digital Television Course Overview
No ratings yet
Digital Television Course Overview
38 pages
Exporting PDFs in PowerBuilder
No ratings yet
Exporting PDFs in PowerBuilder
2 pages
Basiccalculus q3 Mod2 Limitlaws Final
No ratings yet
Basiccalculus q3 Mod2 Limitlaws Final
25 pages
Cerf Solution Properties
No ratings yet
Cerf Solution Properties
12 pages
Overview of Database Management Systems
No ratings yet
Overview of Database Management Systems
53 pages
Fluid Power and Machine Design Course
No ratings yet
Fluid Power and Machine Design Course
33 pages
Aopack Intelligent Box Style System
No ratings yet
Aopack Intelligent Box Style System
6 pages
Global Cropland N2O Emissions Analysis
No ratings yet
Global Cropland N2O Emissions Analysis
14 pages
Grade 12 Earth and Life Science Module
No ratings yet
Grade 12 Earth and Life Science Module
124 pages
NCHRP RPT 620AppendixA
No ratings yet
NCHRP RPT 620AppendixA
18 pages
Electrical Practical Problems and Solutions
No ratings yet
Electrical Practical Problems and Solutions
13 pages
Industrial Concrete Floor Design Guide
No ratings yet
Industrial Concrete Floor Design Guide
16 pages
Mastering File Management in Office 2016
No ratings yet
Mastering File Management in Office 2016
24 pages

GANs: Concepts and Training Methods

Uploaded by

GANs: Concepts and Training Methods

Uploaded by

ONE WEEK ATAL FDP

Some Challenges with GANs

Advanced GAN Extensions

Generative Adversarial Networks

Generative Models Neural Networks

Latent Generated (fake image)

(fake data) Generated

Latent Generated (fake image)

Which network should I train first?

› But with what training data?

Which network should I train first?

But with what training data?

What’s next?-> Train the Generator

But how?What’s our training objective?

What’s next?-> Train the Generator

But how?What’s our training objective?

Generate images from the Generator

• We’ve only seen discriminative models so far

• Discriminative models have several key limitations

• Generative models (in general) cope with all of above

Which one is Computer generated?

• GANs extend that idea to generative models:

• Z is some random noise (Gaussian/Uniform).

• It is formulated as a minimax game, where:

𝑉 𝐷, 𝐺 = D𝑥∼𝑝(𝑥) log 𝐷 𝑥 + D𝑧∼𝑞(𝑧) log 1 − 𝐷 𝐺(𝑧)

• The Nash equilibrium of this particular game is achieved at:

𝛻𝜃𝘎 𝑉(𝐷, 𝐺) = 𝛻𝜃𝘎 D𝑧~𝑞(𝑧) log 1 − 𝐷 𝐺 𝑧

• 𝛻𝑎 log 1 − 𝜎 𝑎 = –𝛻 𝑎 T(𝑎) = –T 𝑎 1–T 𝑎

• Gradient goes to 0 if 𝐷 is confident, i.e. 𝐷 𝐺 𝑧 →0

• Minimize −D𝑧~𝑞 𝑧 log 𝐷 𝐺 𝑧 for Generator instead (keep Discriminator as it is)

Goodfellow, Ian, et al. "Generativeadversarial nets." Advances in neural information processingsystems.2014.

• Use Batch Normalization after

• GANs instead involve two (or more) players

• SGD was not designed to find the Nash equilibrium of a game.

• State 1: x>0 y>0 V> 0 Increase y Decrease x

• State 2: x<0 y>0 V< 0 Decrease y Decrease x

• State 3: x<0 y<0 V> 0 Decrease y Increase x

• State 4 : x>0 y<0 V< 0 Increase y Increase x

• State 5: x>0 y>0 V> 0 == State 1 Increase y Decrease x

• To address this problem,

• Feed those features to the discriminator along with the image

• Feature values will differ b/w diverse and non-diverse batches

• Empirically generates much better samples

• In this formulation, Discriminator’s strategy was 𝐷 𝑥 → 1, 𝐷 𝐺 𝑧 →0

𝑉 𝐷, 𝐺 = D𝑥∼𝑝(𝑥) log 1 − 𝐷 𝑥 + D𝑧∼𝑞(𝑧) log 𝐷 𝐺(𝑧)

• In this new formulation, Discriminator’s strategy will be 𝐷 𝑥 → 0, 𝐷 𝐺 𝑧 →1

𝐷∗ = 𝑎𝑟𝑔𝑚𝑎𝑥𝐷 D 𝑥∼𝑝(𝑥) log 1 − 𝐷 𝑥 + D𝑧∼𝑞(𝑧) log 𝐷 𝐺(𝑧)

We can use this 𝐷∗ = 𝑎𝑟𝑔𝑚𝑖𝑛𝐷 D𝑥∼𝑝 𝑥 log 𝐷 𝑥 + D𝑧∼𝑞(𝑧) log 1 − 𝐷 𝐺 𝑧

𝐷∗ = 𝑎𝑟𝑔𝑚𝑖𝑛𝐷 D𝑥∼𝑝 𝑥 𝐷 𝑥 + D𝑧∼𝑞 𝑧 max 0, 𝑚 − 𝐷 𝐺 𝑧

• Modified game plans

• Use AutoEncoder inside the Discriminator

• Use Mean-Squared Reconstruction error as 𝑫 𝒙

Figure 1 in the original paper.

• Training is conditioned on the images

• Conditional GANs provide an effective

Given a text description, generate

Uses a conditional GAN with the

Figure 2 in the original paper.

Positive Example: Negative Examples:

Figure 1 in the original paper.

Figure 3 in the original paper.

You might also like