Question: Anaswer question P 1 . 3 , P 2 . 3 , P 3 . 1 and P 3 . 2 . ( Dont write

Anaswer question P

1.3,

2.3,

3.1

and P

3.2 . (

Dont write description, just code

) .

Also review these codes and find the isuse. Check the following code lines

(

DONT USE AI

),

against these questions and modify if any issue is there.

import numpy as np

import matplotlib.pyplot as plt

from tensorflow import keras

from tensorflow.keras import layers

from tensorflow.keras.datasets import mnist

from tensorflow.keras.models import Sequential

(

_

train, y

_

train

), (

_

test, y

_

test

) =

mnist.load

_

data

()

_

train

=

_

train.astype

("

float

32 ") / 255.0

_

test

=

_

test.astype

("

float

32 ") / 255.0

_

train

=

.

expand

_

dims

(

_

train,

- 1)

_

test

=

.

expand

_

dims

(

_

test,

- 1)

input

_

shape

=

_

train.shape

[1

]

num

_

classes

= 10

epochs

= 1000

def build

_

mlp

_

model

()

model

=

Sequential

([

layers.Flatten

(

input

_

shape

=

input

_

shape

),

layers.Dense

(4,

activation

=

"relu"

),

layers.Dense

(4,

activation

=

"relu"

),

layers.Dense

(4,

activation

=

"relu"

),

layers.Dense

(4,

activation

=

"relu"

),

layers.Dense

(4,

activation

=

"relu"

),

layers.Dense

(4,

activation

=

"relu"

),

layers.Dense

(

.

prod

(

input

_

shape

),

activation

=

"sigmoid"

),

layers.Reshape

(

input

_

shape

)

])

return model

mlp

_

model

=

build

_

mlp

_

model

()

mlp

_

model.compile

(

optimizer

= "

sgd

",

loss

=

"mse"

)

mlp

_

model.summary

()

mlp

_

history

=

mlp

_

model.fit

(

_

train

[

1],

_

train

[

1],

epochs

=

epochs, batch

_

size

= 1,

verbose

= 1)

plt

.

plot

(

mlp

_

history.history

['

loss

'])

plt

.

title

('

MLP Training Loss'

)

plt

.

xlabel

('

Epoch

')

plt

.

ylabel

('

Loss

')

plt

.

show

()

def build

_

cnn

_

model

()

model

=

Sequential

([

layers.Conv

2

(10,

kernel

_

size

= (5, 5),

activation

=

'relu', padding

=

'same', input

_

shape

=

input

_

shape

),

layers.Conv

2

(

input

_

shape

[- 1],

kernel

_

size

= (1, 1),

activation

=

'sigmoid', padding

=

'same'

)

])

return model

cnn

_

model

=

build

_

cnn

_

model

()

cnn

_

model.compile

(

optimizer

= "

sgd

",

loss

=

"mse"

)

cnn

_

model.summary

()

cnn

_

history

=

cnn

_

model.fit

(

_

train

[

1],

_

train

[

1],

epochs

=

epochs, batch

_

size

= 1,

verbose

= 1)

plt

.

plot

(

cnn

_

history.history

['

loss

'])

plt

.

title

('

CNN Training Loss'

)

plt

.

xlabel

('

Epoch

')

plt

.

ylabel

('

Loss

')

plt

.

show

()

1 =

.

eye

(

input

_

shape

[0])

2 =

.

eye

(4)

3 =

.

eye

(4)

1 =

.

zeros

((4,))

2 =

.

zeros

((4,))

3 =

.

zeros

(

input

_

shape

)

3 =

.

ones

(

input

_

shape

) *

_

train

[0] [0] [0] [0]

def build

_

identity

_

cnn

_

model

()

model

=

keras.Sequential

([

layers.Conv

2

(

input

_

shape

[- 1],

kernel

_

size

= (1, 1),

activation

=

'linear', padding

=

'same', input

_

shape

=

input

_

shape

)

])

return model

def build

_

identity

_

relu

_

cnn

_

model

()

model

=

keras.Sequential

([

layers.Conv

2

(

input

_

shape

[- 1],

kernel

_

size

= (1, 1),

activation

=

'relu', padding

=

'same', input

_

shape

=

input

_

shape

)

])

return model

Answer these questions:

1.1 -

Implement a fully connected neural network h:

[0, 1]^28

28 (

up to

) [0, 1]^28

28

model that regresses an image into itself. The architecture should have

7

trainable dense layers: the first

6

layers with

4

neurons and ReLU activation, and an output layer with the necessary number of units and activation.

1.2 -

Train the model using SGD on the appropriate loss function for

10^3

epochs on the training data. Plot the training loss over epochs.

1.3 -

Plot the prediction over the training set and test set

(

you should spot a pattern in the predictions, but since there is some randomness associated with using the GPU we recommend repeating the training

3 - 5

times to be sure you pick up the right pattern

) .

Which function do you conjecture h

(

)

has learnt

(

write it in formula

) ?

2.1 -

Implement a CNN g:

[0, 1]^28

28 (

up to

) [0, 1]^28

28

model that regresses an image into itself. The architecture should have

2

convolutional layers: the first with

10

filters, kernel size

5

5

and the same output size as input, and the second a convolutional output layer with the necessary number of filters, kernel and activation.

2.2 -

Train the model using SGD on the appropriate loss function for

10^3

epochs on the training data. Plot the training loss over ephocs.

2.3 - [

exaclty the same as P

1.3

but for g

(

)]

3.1 -

Consider a multilayer ReLU network h: R

^

(

up to

)

^

n such that h

(

) =

3

ReLU

(

2

ReLU

(

1

+

1) +

2) +

3

with W

1 (

as an element of

)

^

a x n

,

2 (

as an element of

)

^

n x a

,

3 (

as an element of

)

^

n x n

,

1 (

as element of

)

^

a; b

2,

3 (

as element of

)

^

.

Find a possible solution for W

1,

2,

3,

1,

2,

3

such that h represents the identity funct

What if you want h to represent a constant function that always outputs x

0 ?

3.2 -

Consider a CNN g: R

^

n x n

(

up to

)

^

n x n model composed by a first hidden convolutional layer with c filters, d x d

(

> 1

odd

)

kernel, identity activation and a suitable convolutional output layer. Find a possible architecture for g

(

.

.

specify the complete architecture, c

,

the values in the filters, padding and stride

)

such that g represents the identity function.

If instead of the identity activation, we use a ReLU activation, how should the architecture change?

(

Note: R for set of real

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!

Portfolio betas and tax considerations [10 points] For this question, we ignore all factors other than the market (i.e., we ignore SMB, HML, momentum, ...). You have constructed 3 portfolios Q, S,...

For this lab you will fill in the skeleton of a Java program that will run a Dragon Trainers. The rules of the game are simple: each player has three dragons that they have trained, and each dragon...

/** * YOUR DESCRIPTION OF THIS PROGRAM HERE * @author YOUR NAME HERE * @version DATE HERE */ import java.util.Random; import java.util.Scanner; public class DragonTrainers { /** * Constant array to...

Computer Organization and Networks Practicals 2021/22 October 9, 2021 Computer Organization and Networks Practicals 2021/22 b68495714b Contents Contents 0 Introduction 3 0.1 Registration . . . . . ....

c++ Overview In this assignment, you will simulate a simple board game. The board is a grid, and starts with a pile of money in each cell. Players take turns rolling four dice to pick a cell, and...

The Assignment Write a program that reads, from the standard input, a description of a weighted graph with integer weights, in the input format shown below. Then the program should write, on the...

Write a program that reads, from the standard input, a description of a weighted graph with integer weights, in the input format shown below. Then the program should write, on the standard output:...

Briefly explain these terms: a. Basic variable. b. Shadow price. c. Range of feasibility. d. Range of optimality.

In Liinear-time selection algorithm where the input array of numbers are cut into groups of size 5. Show that, when the group size is 7, the algorithm still runs in linear time.

The exchange rate of Thai Baht as of April 1 st , 2 0 2 5 is $ 3 4 . 1 9 Thai Baht, which is equivalent to $ 1 . 0 0 US dollar. In this section you forecast exchange rate. Use the spot exchange rate...

Compared with half a century ago, adoption has become _ _ _ _ _ _ _ _ _ common, but it is more open and acceptabl e , so we probably discuss it _ _ _ _ _ _ _ . fill in the blanks more or much less or...

Why do HCMSs exist? Do they change over time?

Suppose the price of oil falls sharply (as it did in 1986 and again in 1998). a. Show the impact of such a change in both the aggregate-demand/aggregate-supply diagram and in the Phillips-curve...

When did the shift from Text-based Business Application Software to GUI-based Applications begin?