Question: 1 Problem 1 : Generative adversarial networks ( 5 0 points ) In this problem, suppose that we will implement a generative adversarial net -

1

Problem

1

: Generative adversarial networks

(50

points

)

In this problem, suppose that we will implement a generative adversarial net

-

work

(

GAN

)

that models a high

-

dimensional data distribution

p_{d a t a} (x),

where

x i n R^{n} .

To do so

,

we will define a generator

G_{}

R^{k} R^{n}

; we obtain samples

from our model by first sampling a k

-

dimensional random vector

z N (0, I)

and then returning

G_{} (z) .

We will also define a discriminator

D_{}

R^{n} (0, 1)

that judges how realistic

the generated images

G_{} (z)

are, compared to samples from the data distribution

x p_{d a t a} (x) .

Because its output is intended to be interpreted as a probability,

the last layer of the discriminattor is frequently the sigmoid function

(x) = \frac{1}{1 + e^{- x}}

There are several common variants of the loss functions used to train a

generative adversarial network

(

GAN

) .

They can all be described as a procedure

where we alternately perform a gradient descent step on

L_{D} (

;

)

with respect

to train the discriminator

D_{},

and a gradient descent step on

L_{G} (

;

)

with

respect to

to train the generator

G_{}

m i n_{} L_{D} (

;

), m i n_{} L_{G} (

;

)

In our lecture, we talked about the following losses, where the discriminator's

loss is given by:

L_{D} (

;

) = - E_{x p_{d a t a} (x)} [l o g D_{} (x)] - E_{z N (0, I)} [l o g (1 - D_{} (G_{} (z)))]

and the generator's loss is given by the minimax loss:

L_{G}^{m i n i m a x} (

;

) = E_{z N (0, I)} [l o g (1 - D_{} (G_{} (z)))]

(25

points

)

the minimax loss for

L_{G}

suffers from vanishing gradient

problem. In terms of the discriminator's logits

,

the minimax loss is

L_{G}^{m i n i m a x} (

;

) = E_{z N (0, I)} [l o g (1 - (h_{} (G_{} (z))))]

Show that the derivative of

L_{G}^{m i n i m a x}

with respect to

is approximately

0

D (G_{} (z))

0,

or equivalently, if

h_{} (G_{} (z)) 0 .

You may use the fact that

^{'} (x) = (x) (1 - (x)) .

Why is this problematic for the training of the generator

when the discriminator successfully identifies a fake sample

G_{} (z) ?

(25

points

)

To solve this vanishing gradient problem, we usually replace

L_{G}^{m i n i m a x}

with other loss functions such as non

-

saturating loss

L_{G}^{n s g a n} [1]

and

more other forms of loss functions can be found in

[2] .

You may plot differ

-

ent loss functions including minimax loss and non

-

saturating loss to show the

contrast. You also need to explain why non

-

saturating loss can avoid vanishing

gradient problem.

L_{G}^{n s g a n} (

;

) = - E_{z N (0, I)} [l o g D_{} (G_{} (z))]

1 Problem 1 : Generative adversarial networks ( 5

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Programming Questions!

Read the above passage and then answer short questions,As for the research issue of this article, please help me to check the opinions of scholars on this issue in the recent five years (annotating...

Read the above passage and then answer short questions Summarize and elaborate the research method of this article in concise language Research on Voiceprint Password Verification Technology Research...

Read the above passage and then answer short questionsWhat is the research tool or platform used in this paper? Research on Voiceprint Password Verification Technology Research Background: The...

Read the above passage and then answer short questionsWhat can be improved about the research method of this paper, that is, where is the gap? Research on Voiceprint Password Verification Technology...

Read the above passage and then answer short questionsThe research method of this paper can be further upgraded and changed. Could you give a general explanation? Research on Voiceprint Password...

Read the above passage and then answer short questionsplease use 1,2,3,4 to write a simple and clear overview of the steps for the research process of this article, a hand-drawn chart is better....

Statistical Standards Board has provided guidance on disclosures of transactions between related parties, for example, transactions between subsidiaries of a common parent. GAAP regarding...

Neural networks In this overloaded problem set, we will study neural network training from the perspective of maximum likelihood and maximum a posteriori parameter learning, and Bayesian parameter...

QUIZ... Let D be a poset and let f : D D be a monotone function. (i) Give the definition of the least pre-fixed point, fix (f), of f. Show that fix (f) is a fixed point of f. [5 marks] (ii) Show that...

Refer to the flexible loaded rectangular area shown in Figure 10.46. Using Eq, (10.35), determine the vertical stress increase below the center of the loaded area at depths z = 2, 4, 6, 8, and 10 m....

For each organic molecule class, address what they are (structure) and what they are used for (function).

2. Describe the SBB instruction.

Problem 1 - Efficiency analysis of workflow system (8 marks) John is the Chief Financial Officer of a very large software company that employs more than 5,000 programmers.He is committed to increase...