Assignment 7 8 Training MLP In this assignment, you are asked to train two different MLPs and applies to the created XOR datasets Additionally, you need to discuss your result There are total 4 tasks Answer the question in the designed place and run all the code For coding parts, ll in your answer in Your code goes to here For text answer, ll in your answer in Your answer goes to here Answer it in plain text Consider a XOR dataset as below import numpy as np Create XOR dataset A classical non linear seperatable dataset Generate data for class A mean 1 1 , 1 cov 1 0 1 , 0 , 0 , 0 1 class 0 data 1 np random multivariate normal ( mean 1 , cov 1 , 1 0 0 ) mean 2 1 , 1 cov 2 0 1 , 0 , 0 , 0 1 class 0 data 2 np random multivariate normal ( mean 2 , cov 2 , 1 0 0 ) class 0 data np concatenate ( ( class 0 data 1 , class 0 data 2 ) , axis 0 ) class 0 labels np zeros ( 2 0 0 ) Generate data for class B mean 3 1 , 1 cov 3 0 1 , 0 , 0 , 0 1 class 1 data 1 np random multivariate normal ( mean 3 , cov 3 , 1 0 0 ) mean 4 1 , 1 cov 4 0 1 , 0 , 0 , 0 1 class 1 data 2 np random multivariate normal ( mean 4 , cov 4 , 1 0 0 ) class 1 data np concatenate ( ( class 1 data 1 , class 1 data 2 ) , axis 0 ) class 1 labels np ones ( 2 0 0 ) Combine data and labels X np concatenate ( ( class 0 data, class 1 data ) , axis 0 ) y np concatenate ( ( class 0 labels, class 1 labels ) , axis 0 ) Visualize the data import matplotlib pyplot as plt plt scatter ( class 0 data , 0 , class 0 data , 1 ) plt scatter ( class 1 data , 0 , class 1 data , 1 ) Start coding or generate with AI import torch c 1 tensor torch from numpy ( class 0 data ) float ( ) c 2 tensor torch from numpy ( class 1 data ) float ( ) Task 1 Training a MLP Model with 1 0 dimensional hidden layer Recall the example shown in class can be described as following Given data , we doing the following process 1 transfer it into hidden feature 2 pass through a sigmoid function 3 transfer to 4 convert to output via sigmoid Now, in this task, you needs to build a MLP model that consists of two linear layer and given a data , we need the following process 1 transfer it into hidden feature 2 pass through a sigmoid function 3 transfer to 4 convert to output via sigmoid x in R 2 h in R 3 in R h o in 0 , 1 x in R 2 h in R 1 0 in R h o in 0 , 1 import torch class MultilayerPerceptron Q 1 ( torch nn Module ) def init ( self ) super ( MultilayerPerceptron Q 1 , self ) init ( ) Your code goes to here def forward ( self , x ) Your code goes to here return None def negative likeihood ( model , katydid tensor, grasshopper tensor ) o model ( katydid tensor ) confidence of a data belongs to the katydid likeihood torch log ( o 0 0 0 0 0 0 1 ) o model ( grasshopper tensor ) likeihood 2 torch log ( ( 1 o ) 0 0 0 0 0 0 1 ) overall loss likeihood sum ( ) likeihood 2 sum ( ) return overall loss randomly hold out samples to evaluate model indices holdout c 1 torch randperm ( len ( c 1 tensor ) ) 1 5 0 indices holdout c 2 torch randperm ( len ( c 2 tensor ) ) 1 5 0 c 1 tensor holdout c 1 tensor indices holdout c 1 c 2 tensor holdout c 2 tensor indices holdout c 2 remove holdout samples from training data ( c 1 tensor and c 2 tensor ) vector c 1 torch zeros ( 2 0 0 ) for i in indices holdout c 1 vector c 1 i 1 c 1 tensor train c 1 tensor vector c 1 1 vector c 2 torch zeros ( 2 0 0 ) for i in indices holdout c 2 vector c 2 i 1 c 2 tensor train c 2 tensor vector c 2 1 gradient descend import torch optim as optim model MultilayerPerceptron Q 1 ( ) op optim SGD ( model parameters ( ) , lr 0 0 1 ) loss lst loss holdout lst n epoch 5 0 0 0 for i in range ( n epoch ) training loss negative likeihood ( model , c 1 tensor train, c 2 tensor train ) compute loss ( training data ) op zero grad ( ) clean cache loss backward ( ) compute gradient op step ( ) gradient descend vaildation ( see the performance in unknown data ( unseen by model ) ) with torch no grad ( ) loss holdout negative likeihood ( model , c 1 tensor holdout, c 2 tensor holdout ) compute loss ( vaildation holdout data ) loss lst append ( loss item ( ) 1 0 0 ) loss holdout lst append ( loss holdout item ( ) 3 0 0 ) print ( loss ) print ( loss holdout ) Task 2 Write down your conclusion after visualize the loss change and the decision bound below plt plot ( np array ( loss lst 6 0 ) ) plt plot ( np array ( loss holdout lst 6 0 ) ) prompt draw decision boundary of logistic regression Generate a grid of points for plotting the decision boundary x min, x max 3 , 3 y min, y max 3 , 3 xx , yy np meshgrid ( np arange ( x min, x max, 0 1 ) , np arange ( y min, y max, 0 1 ) ) Create a tensor from the grid points grid tensor torch tensor ( np c xx ravel ( ) , yy ravel

The Answer is in the image, click to view ...

Question: Assignment 7 & 8 : Training MLP In this assignment, you are asked to train two different MLPs and applies to the created XOR datasets.

Assignment

7

8

: Training MLP

In this assignment, you are asked to train two different MLPs and applies to the created XOR datasets. Additionally, you need to discuss your result. There are total

4

tasks Answer the question in the designed place and run all the code. For coding parts, ll in your answer in: #Your code goes to here

For text answer, ll in your answer in: Your answer goes to here. Answer it in plain text Consider a XOR dataset as below import numpy as np #Create XOR dataset. A classical non

-

linear seperatable dataset # Generate data for class A mean

1 = [- 1, - 1]

cov

1 = [[0.1, 0], [0, 0.1]]

class

0_

data

1 =

.

random.multivariate

_

normal

(

mean

1,

cov

1, 100)

mean

2 = [1, 1]

cov

2 = [[0.1, 0], [0, 0.1]]

class

0_

data

2 =

.

random.multivariate

_

normal

(

mean

2,

cov

2, 100)

class

0_

data

=

.

concatenate

((

class

0_

data

1,

class

0_

data

2),

axis

= 0)

class

0_

labels

=

.

zeros

(200)

# Generate data for class B mean

3 = [- 1, 1]

cov

3 = [[0.1, 0], [0, 0.1]]

class

1_

data

1 =

.

random.multivariate

_

normal

(

mean

3,

cov

3, 100)

mean

4 = [1, - 1]

cov

4 = [[0.1, 0], [0, 0.1]]

class

1_

data

2 =

.

random.multivariate

_

normal

(

mean

4,

cov

4, 100)

class

1_

data

=

.

concatenate

((

class

1_

data

1,

class

1_

data

2),

axis

= 0)

class

1_

labels

=

.

ones

(200)

# Combine data and labels X

=

.

concatenate

((

class

0_

data, class

1_

data

),

axis

= 0)

=

.

concatenate

((

class

0_

labels, class

1_

labels

),

axis

= 0)

Visualize the data

import matplotlib.pyplot as plt plt

.

scatter

(

class

0_

data

[

, 0],

class

0_

data

[

, 1])

plt

.

scatter

(

class

1_

data

[

, 0],

class

1_

data

[

, 1])

Start coding or generate with AI

.

import torch c

1_

tensor

=

torch.from

_

numpy

(

class

0_

data

) .

float

()

2_

tensor

=

torch.from

_

numpy

(

class

1_

data

) .

float

()

Task

1

: Training a MLP Model with

10

dimensional hidden layer

Recall the example shown in class can be described as following: Given data

,

we doing the following process:

1 .

transfer it into hidden feature

2 .

pass through a sigmoid function

3 .

transfer to

4 .

convert to output via sigmoid. Now, in this task, you needs to build a MLP model that consists of two linear layer and given a data

,

we need the following process:

1 .

transfer it into hidden feature

2 .

pass through a sigmoid function

3 .

transfer to

4 .

convert to output via sigmoid. x in R

2

h in R

3

in R h

o in

[0, 1]

x in R

2

h in R

10

in R h

o in

[0, 1]

import torch class MultilayerPerceptron

_

1 (

torch

.

.

Module

)

: def

__

init

__(

self

)

: super

(

MultilayerPerceptron

_

1,

self

) .__

init

__()

#Your code goes to here def forward

(

self

,

)

: #Your code goes to here return None def negative

_

likeihood

(

model

,

katydid

_

tensor, grasshopper

_

tensor

)

: o

=

model

(

katydid

_

tensor

)

#confidence of a data belongs to the katydid likeihood

= -

torch.log

(

+ 0.000001)

=

model

(

grasshopper

_

tensor

)

likeihood

2 = -

torch.log

((1 -

) + 0.000001)

overall

_

loss

=

likeihood.sum

() +

likeihood

2 .

sum

()

return overall

_

loss # randomly hold out samples to evaluate model indices

_

holdout

_

1 =

torch.randperm

(

len

(

1_

tensor

)) [

150]

indices

_

holdout

_

2 =

torch.randperm

(

len

(

2_

tensor

)) [

150]

1_

tensor

_

holdout

=

1_

tensor

[

indices

_

holdout

_

1]

2_

tensor

_

holdout

=

2_

tensor

[

indices

_

holdout

_

2]

# remove holdout samples from training data

(

1_

tensor and c

2_

tensor

)

vector

_

1 =

torch.zeros

(200)

for i in indices

_

holdout

_

1

: vector

_

1 [

] = 1

1_

tensor

_

train

=

1_

tensor

[

vector

_

1! = 1]

vector

_

2 =

torch.zeros

(200)

for i in indices

_

holdout

_

2

: vector

_

2 [

] = 1

2_

tensor

_

train

=

2_

tensor

[

vector

_

2! = 1]

# gradient descend import torch.optim as optim model

=

MultilayerPerceptron

_

1 ()

=

optim.SGD

(

model

.

parameters

(),

= 0.01)

loss

_

lst

= []

loss

_

holdout

_

lst

= []

_

epoch

= 5000

for i in range

(

_

epoch

)

: # training loss

=

negative

_

likeihood

(

model

,

1_

tensor

_

train, c

2_

tensor

_

train

)

#compute loss

(

training data

)

.

zero

_

grad

()

#clean cache loss.backward

()

#compute gradient op

.

step

()

#gradient descend # vaildation:

(

see the performance in unknown data

(

unseen by model

))

with torch.no

_

grad

()

: loss

_

holdout

=

negative

_

likeihood

(

model

,

1_

tensor

_

holdout, c

2_

tensor

_

holdout

)

#compute loss

(

vaildation

/

holdout data

)

loss

_

lst

.

append

(

loss

.

item

() / 100)

loss

_

holdout

_

lst

.

append

(

loss

_

holdout.item

() / 300)

(

loss

)

(

loss

_

holdout

)

Task

2

: Write down your conclusion after visualize the loss change and the decision bound below:

plt

.

plot

(

.

array

(

loss

_

lst

[60

]))

plt

.

plot

(

.

array

(

loss

_

holdout

_

lst

[60

]))

# prompt: draw decision boundary of logistic regression # Generate a grid of points for plotting the decision boundary x

_

min, x

_

max

= - 3, 3

_

min, y

_

max

= - 3, 3

,

=

.

meshgrid

(

.

arange

(

_

min, x

_

max,

0.1),

.

arange

(

_

min, y

_

max,

0.1))

# Create a tensor from the grid points grid

_

tensor

=

torch.tensor

(

.

_[

.

ravel

(),

.

ravel

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Programming Questions!

In this assignment, you are asked to train two different MLPs and applies to the created XOR datasets. Additionally, you need to discuss your result. There are total 4 tasks Answer the question in...

London School of Science & Technology Qualification Unit number and title BTEC Level 5 HND Diploma Business UNIT 6: Business Decision Making Student name and ID number Assessor name Al Hassan Barrie...

Hi I need assistance with this assignment it is due TONIGHT. Please help. Assignment 1 Role of The Manager and The Impact of Organizational Theories on Managers (Week 3) Purpose: In the first...

PAPERS What Project Strategy Really Is: The Fundamental Building Block in Strategic Project ManagementPeerasit Patanakul, Stevens Institute of Technology, Hoboken, NJ, USA Aaron J. Shenhar, Rutgers...

I need to see the SPSS output. You need to have all z-scores, all charts, all descriptives data from SPSS, everything you used to answer the questions. I am sending you what the previous tutor sent...

Case Study of Ritz-Carlton 4. In what may be a first for the hospitality industry, Brian Collins, hotel owner, has asked James McBride, Ritz-Carlton general manager, to lengthen the amount of time...

End of Chapter 1 Answers 1) Self-acceptance is getting to move on despite what people thinks about us, it is more of self- esteem, white self-esteem is how important we see our self. It is good to...

Jupiter Notebook We have covered some of the limitations of single layer neural networks in class, but they are still powerful learning systems that provide a good way to begin learning about how to...

ICT201 - ICT PM Assignment 1 - Information Sheet Murdoch University Page 1 ICT201 - Information Technology Project Management Assignment 1 Project Proposal and Preliminary Planning INTRODUCTORY...

Part 1 After reading this unit's resources, complete the following discussion. Please respond to the specific questions posed below, and ensure you utilize a minimum of two relevant sources along...

Dilute NaOH is introduced into a solution that is 0.050 M in Cu2+ and 0.040 M in Mn2+. (a) Which hydroxide precipitates first? (b) What OH- concentration is needed to initiate precipitation of the...

Using the Gibbs function data, determine the equilibrium constant KP for the dissociation process O2 2O at 2000 K. Compare your result with the KP value listed in Table A-28.

*35. For what reasons should the percentage-of-completion method be used over the cost-recovery method whenever possible?

What is the volume of this cone Use 3 14 and round your answer to the nearest hundredth 20 in 30 in cubic inches