Question: Modify your 'gradient descent assignment I ' code in order to train the single hidden layer feedforward neural network ( FFNN ) discussed in class.

Modify your 'gradient descent assignment I

'

code in order to train the single hidden layer

feedforward neural network

(

FFNN

)

discussed in class.

You must use the same code to build your training set from 'gradient descent assignment

' .

Instead of initializing a single weight matrix to random values, you will now need to

initialize

W,

and

to random values.

You must choose the number of neurons

m (

a good starting point would be

m = 2 n) .

.

Upload your working Python script file

.

Make sure to output your training matrices

x

and

Y

or you will not receive credit for

this assignment.

.

Upload an image of your mserror plot in

.

png or

.

jpg format

.

Make sure to output your weight matrices

W,, .

.

Upload all your materials

(

code

,

output and plots

)

in one single file in either pdf or docx

format.

import numpy as np

import matplotlib.pyplot as plt

def linearnn

(

,

,

nits, eta

)

# initialize dimensions

xshape

=

.

shape

yshape

=

.

shape

=

xshape

[0]

=

xshape

[1]

=

yshape

[0]

# build the input data matrix

=

.

ones

((1,

))

=

.

concatenate

((

,

),

axis

= 0)

# initialize random weights

=

.

random.normal

(

size

= (

,

+ 1))

# initialize mserror storage

mserror

=

.

zeros

((1,

nits

))

# train the network using gradient descent

for itcount in range

(

nits

)

" " "

Section of code to update network weights

. . .

3 .

Compute the gradient of the mean squared error and store it in a variable named 'gradientofmse'

" " "

gradientofmse

=

.

dot

((

.

dot

(

,

) -

),

.

)

" " "

4 .

Given the variable 'gradientofmsenorm', update the network weight matrix, w using the gradient descent formula given in class

" " "

=

-

eta

*

gradientofmse

# calculate MSE

etemp

= 0

for trcount in range

(

)

# get

(

(

) -

)

column vector

avec

=

.

concatenate

((

[

,

trcount

], [1]),

axis

= 0)

=

.

dot

(

,

avec

) -

[

,

trcount

]

# calculate the error

etemp

=

etemp

+

.

transpose

() *

mserror

[0,

itcount

] = (1 / (2 *

)) *

etemp

[0]

# test the output

netout

=

.

dot

(

,

)

return netout, mserror, w

# 'main' starts here

# set up the training set

= 3

= 2

= 5

=

.

random.normal

(

size

= (

,

))

=

.

random.normal

(

size

= (

,

))

# compute the closed form solution

=

.

ones

((1,

))

=

.

concatenate

((

,

),

axis

= 0)

wpinv

=

.

dot

(

,

.

linalg.pinv

(

))

('

Weight matrix computed using the pseudoinverse:

')

(

wpinv

)

()

" " "

Section of code to call the gradient descent function to computer the weight matrix

1 .

You must input a value for the number of iterations, nits

2 .

You must input a value for the learning rate, eta

(

You should run numerical tests with a few different values

)

" " "

nits

= 1000

# Set the number of iterations

eta

= 0.01

# Set the learning rate

netout, mserror, w

=

linearnn

(

,

,

nits, eta

)

('

Weight matrix computed via gradient descent:

')

(

)

()

" " "

Compute the sum of the absolute value of the difference between the weight matrices from pinv and gradient descent

" " "

_

sum

_

abs

_

diff

=

.

sum

(

.

absolute

(

wpinv

-

))

('

Sum of absolute value of the difference between weight matrices:

')

(

_

sum

_

abs

_

diff

)

# plot the mean squared error

plt

.

plot

(

mserror

[0,

])

plt

.

title

('

Mean Squared Error versus Iteration'

)

plt

.

xlabel

('

Iteration #

')

plt

.

ylabel

('

MSE

')

plt

.

savefig

('

plot

.

png

')

Modify your 'gradient descent assignment I' code in order to train

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!

Modify your 'gradient descent assignment I ' code in order to train the single hidden layer feedforward neural network ( FFNN ) discussed in class. You must use the same code to build your training...

Machine Learning - doing neural networks This is all to be written in Python Introduction In Part 1 of this assignment you will implement a basic neural net in numpy. You are not to use any libraries...

Jupiter Notebook We have covered some of the limitations of single layer neural networks in class, but they are still powerful learning systems that provide a good way to begin learning about how to...

Part 2 : Deep Averaging Network ( 7 5 points ) In this part, you ll implement a deep averaging network as discussed in lecture and in Iyyer et al . ( 2 0 1 5 ) . If our input s = ( w 1 , . . . , wn )...

Hi, Can you please help me with assignment, I am failing to create the train_nn function. Please advise how I can get data to you, my previous efforts have failed. Tensorflow_NeuralNetworkspdf May 1,...

***************************************************************************************** Matlab code to help with the following program, please solve as many steps as possible...

plz giveconclusion differences The network learns the received data until it is able to provide an output that is in accordance with the set criteria (the number of training epochs or error...

Give Correct ANSWERS Human-Computer Interaction (a) If you had been one of the original inventors of the WIMP interface, and engineers on the technical team had been sceptical about the advantages...

1. Write a program implementing multi-layer feed-forward neural networks and training them with back- propagation including momentum. Your program must be able to handle any number of hidden layers...

Financial Trend 1onitoring SystemReserves. The City of Scottsdale, Arizona, provides the following report on the following page about its reserve policies on page 12 of its October 2007 financial...

Assume a companys price-to-earnings ratio varies randomly in the band of values between 12 and 18 over time. For simplicity, assume its earnings are stable for the next 10 years and no dividends are...

What makes the organization win over competition?

Multiple different projects are underway simultaneously, but all of these projects support a common strategic objective. This collection of projects is best known as:

a. Describe the encounter. What made it intercultural?

2. Describe three approaches to the study of intercultural communication.

c. Describe how you felt after the encounter, and explain why you think you felt as you did.