In this project, we will be developing a basic neural network from the ground up to classify various types of fashion items The primary objective of this project is to gain a comprehensive understanding of neural network architecture, including its theory and implementation details initializing Notice that you don't need any other packages for this mid term import numpy as np import pandas as pd import random from matplotlib import pyplot as plt random seed ( 4 2 ) NEVER change this line Reading the dataset data pd read csv ( ' fashion data csv ' ) The data pre processing is done for you Please do NOT edit the cell However, you should understand what these codes are doing data np array ( data ) m , n data shape np random shuffle ( data ) shuffle before splitting into dev and training sets data dev data 0 4 0 0 T Y dev data dev 1 X dev data dev 0 n 1 X dev X dev 2 5 5 data train data 4 0 0 m T Y train data train 1 X train data train 0 n 1 X train X train 2 5 5 , m train X train shape PART 1 building NN initializing parameters Initialize the parameters in the neural network Based on the figure above, we need the weight and bias matrices W 1 , b 1 are the matrices for the first layer W 2 , b 2 are the matrices for the second layer You should think about the sizes of the matrices then initialize elements in the matrix to be random numbers between 0 5 to 0 5 def init params ( ) W 1 Your code here b 1 Your code here W 2 Your code here b 2 Your code here return W 1 , b 1 , W 2 , b 2 As a starting point, you only need a ReLu function, its derivative, and the softmax function def ReLU ( Z ) Your code here def ReLU deriv ( Z ) Your code here def softmax ( Z ) Your code here return A In the forward propagation function, X is the inputs ( the image in vector form ) , and we pass all the weights and biases def forward prop ( W 1 , b 1 , W 2 , b 2 , X ) Z 1 Your code here A 1 Your code here Z 2 Your code here A 2 Your code here return Z 1 , A 1 , Z 2 , A 2 backward propagation This one hot function is to convert a numeric number into a one hot vector def one hot ( Y ) Your code here return one hot Y Now performing the backward propagation Each function is only one line, but lots of Calculus behind def backward prop ( Z 1 , A 1 , Z 2 , A 2 , W 1 , W 2 , X , Y ) one hot Y one hot ( Y ) dZ 2 Your code here dW 2 Your code here db 2 Your code here dZ 1 Your code here dW 1 Your code here db 1 Your code here return dW 1 , db 1 , dW 2 , db 2 Finally, we are ready to update the parameters def update params ( W 1 , b 1 , W 2 , b 2 , dW 1 , db 1 , dW 2 , db 2 , alpha ) W 1 Your code here b 1 Your code here W 2 Your code here b 2 Your code here return W 1 , b 1 , W 2 , b 2 gradient descent Implement the helper function We need to convert the softmax output into a numeric label This is done through get predictions function def get predictions ( A 2 ) Your code here We also want to have a simple function to compute the accuracy Notice that predictions and Y are the same shape def get accuracy ( predictions , Y ) return Your code here Finally, we are ready to implement gradient descent def gradient descent ( X , Y , alpha, iterations ) W 1 , b 1 , W 2 , b 2 Your code here using the function you have implemented for i in range ( iterations ) Z 1 , A 1 , Z 2 , A 2 Your code here using the function you have implemented dW 1 , db 1 , dW 2 , db 2 Your code here using the function you have implemented W 1 , b 1 , W 2 , b 2 Your code here using the function you have implemented if i 1 0 0 print ( Iteration , i ) predictions get predictions ( A 2 ) print ( get accuracy ( predictions , Y ) ) return W 1 , b 1 , W 2 , b 2 validation set def make predictions ( X , W 1 , b 1 , W 2 , b 2 ) , , , A 2 forward prop ( W 1 , b 1 , W 2 , b 2 , X ) predictions get predictions ( A 2 ) return predictions dev predictions make predictions ( X dev, W 1 , b 1 , W 2 , b 2 ) get accuracy ( dev predictions, Y dev ) exploring some samples def test prediction ( index , W 1 , b 1 , W 2 , b 2 ) current image X train , index, None prediction make predictions ( X train , index, None , W 1 , b 1 , W 2 , b 2 ) label Y train index print ( Prediction , prediction ) print ( Label , label ) current image current image reshape ( ( 2 8 , 2 8 ) ) 2 5 5 plt gray ( ) plt imshow ( current image, interpolation 'nearest' ) plt show ( ) test prediction ( 0 , W 1 , b 1 , W 2 , b 2 ) test prediction ( 1 , W 1 , b 1 , W 2 , b 2 ) Part 2 Error Analysis and Performance Improvements You now will try to improve the model performance through, for example, different activation functions, learning rate cahnges, expanding the network complexity, regularization, and dropouts Note solve Part 2 in detail with reasons why model did or did not improve

The Answer is in the image, click to view ...

Question: In this project, we will be developing a basic neural network from the ground up to classify various types of fashion items. The primary objective

In this project, we will be developing a basic neural network from the ground up to classify various types of fashion items. The primary objective of this project is to gain a comprehensive understanding of neural network architecture, including its theory and implementation details.

#initializing

# Notice that you don't need any other packages for this mid

-

term

import numpy as np

import pandas as pd

import random

from matplotlib import pyplot as plt

random.seed

(42)

# NEVER change this line

# Reading the dataset

data

=

.

read

_

csv

(' . /

fashion

_

data.csv

')

# The data pre

-

processing is done for you. Please do NOT edit the cell

# However, you should understand what these codes are doing

data

=

.

array

(

data

)

,

=

data.shape

.

random.shuffle

(

data

)

# shuffle before splitting into dev and training sets

data

_

dev

=

data

[0

400] .

_

dev

=

data

_

dev

[- 1]

_

dev

=

data

_

dev

[0

- 1]

_

dev

=

_

dev

/ 255 .

data

_

train

=

data

[400

] .

_

train

=

data

_

train

[- 1]

_

train

=

data

_

train

[0

- 1]

_

train

=

_

train

/ 255 .

_,

_

train

=

_

train.shape

PART

1

building NN

#initializing parameters

# Initialize the parameters in the neural network

# Based on the figure above, we need the weight and bias matrices.

# W

1,

1

are the matrices for the first layer

# W

2,

2

are the matrices for the second layer

# You should think about the sizes of the matrices

# then initialize elements in the matrix to be random numbers between

- 0.5

+ 0.5

def init

_

params

()

1 =

# Your code here

1 =

# Your code here

2 =

# Your code here

2 =

# Your code here

return W

1,

1,

2,

2

# As a starting point, you only need a ReLu function, its derivative, and the softmax function

def ReLU

(

)

# Your code here

def ReLU

_

deriv

(

)

# Your code here

def softmax

(

)

# Your code here

return A

# In the forward propagation function, X is the inputs

(

the image in vector form

),

and we pass all the weights and biases

def forward

_

prop

(

1,

1,

2,

2,

)

1 =

# Your code here

1 =

# Your code here

2 =

# Your code here

2 =

# Your code here

return Z

1,

1,

2,

2

#backward propagation

# This one hot function is to convert a numeric number into a one

-

hot vector

def one

_

hot

(

)

# Your code here

return one

_

hot

_

# Now performing the backward propagation

# Each function is only one line, but lots of Calculus behind

def backward

_

prop

(

1,

1,

2,

2,

1,

2,

,

)

one

_

hot

_

=

one

_

hot

(

)

2 =

# Your code here

2 =

# Your code here

2 =

# Your code here

1 =

# Your code here

1 =

# Your code here

1 =

# Your code here

return dW

1,

1,

2,

2

# Finally, we are ready to update the parameters

def update

_

params

(

1,

1,

2,

2,

1,

1,

2,

2,

alpha

)

1 =

# Your code here

1 =

# Your code here

2 =

# Your code here

2 =

# Your code here

return W

1,

1,

2,

2

#gradient descent

# Implement the helper function. We need to convert the softmax output into a numeric label

# This is done through get

_

predictions function

def get

_

predictions

(

2)

# Your code here

# We also want to have a simple function to compute the accuracy. Notice that "predictions" and

"

"

are the same shape

def get

_

accuracy

(

predictions

,

)

return # Your code here

# Finally, we are ready to implement gradient descent

def gradient

_

descent

(

,

,

alpha, iterations

)

1,

1,

2,

2 =

# Your code here

-

using the function you have implemented

for i in range

(

iterations

)

1,

1,

2,

2 =

# Your code here

-

using the function you have implemented

1,

1,

2,

2 =

# Your code here

-

using the function you have implemented

1,

1,

2,

2 =

# Your code here

-

using the function you have implemented

if i

% 10 = = 0

("

Iteration:

",

)

predictions

=

get

_

predictions

(

2)

(

get

_

accuracy

(

predictions

,

))

return W

1,

1,

2,

2

#validation set

def make

_

predictions

(

,

1,

1,

2,

2)

_,_,_,

2 =

forward

_

prop

(

1,

1,

2,

2,

)

predictions

=

get

_

predictions

(

2)

return predictions

dev

_

predictions

=

make

_

predictions

(

_

dev, W

1,

1,

2,

2)

get

_

accuracy

(

dev

_

predictions, Y

_

dev

)

#exploring some samples

def test

_

prediction

(

index

,

1,

1,

2,

2)

current

_

image

=

_

train

[

,

index, None

]

prediction

=

make

_

predictions

(

_

train

[

,

index, None

],

1,

1,

2,

2)

label

=

_

train

[

index

]

("

Prediction:

",

prediction

)

("

Label:

",

label

)

current

_

image

=

current

_

image.reshape

((28, 28)) * 255

plt

.

gray

()

plt

.

imshow

(

current

_

image, interpolation

=

'nearest'

)

plt

.

show

()

test

_

prediction

(0,

1,

1,

2,

2)

test

_

prediction

(1,

1,

1,

2,

2)

Part

2

: Error Analysis and Performance Improvements

You now will try to improve the model performance through, for example, different activation functions, learning rate cahnges, expanding the network complexity, regularization, and dropouts. Note solve Part

2

in detail with reasons why model did or did not improve

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!

In this project, we will be developing a basic neural network from the ground up to classify various types of fashion items. The primary objective of this project is to gain a comprehensive...

Project Title: Fashion - MNIST Classification using a Simple Neural Network Overview: The goal of this project is to implement a basic neural network for classifying images from the Fashion - MNIST...

Could you please explain the findings of the study? A systematic review shows no performance benefit of machine learning over logistic regression for clinical prediction models Evangelia...

What source of competitive advantage does eBay have, and is that position supported by its resources and assets? 250 and over words in total CASE 13 players such as Yahoo and Amazon posed a major...

\fMarketing Fashion: A Global Perspective 00_INT_Rath_FM_FNL.indd 1 4/16/12 6:54 PM 00_INT_Rath_FM_FNL.indd 2 4/16/12 6:54 PM Marketing Fashion: A Global Perspective Patricia Mink Rath Marketing...

Answer the following questions according to the case (Supply Chain Management in STARBUCKS and its Impact on Company Performance) provided below the asked questions: What are the main issues that...

Hi this is the last multiple choice exam I have and this one only covers 3 chapters so I think it might be shorter Core Concepts of Accounting Information Systems, Canadian Edition Chapter 13-1...

You are requested to write its Summary 346 Gillian Rice My focus in this paper is on the ethical principles which relate to business and which are contained in the religion of Islam. Islam is gen-...

Who is chief knowledge officer? What the primary role? A senior executive in an organization responsible for ensuring that firm fully utilizes the value it gets through knowledge- which is the most...

(a) Solve 4x + 7 = 3(x-2). [4 marks] (b) Identify the polynomials, its terms and its coefficients in the expression 4y-3y + 2. [6 marks] (c) Solve 5 - 3x

How are pay levels and pay grades related and used as part of a companys compensation and benefit package?

What is a product mix?

Which of the following are problems with identifying users of ABC? Multiple select question. ABC means different things to different organizations. Organizations will announce the discontinuance of...

1. In what ways has flexible working revolutionised employment?

4. To what extent do you agree with some critics who have claimed that Richard Bransons statements on time off work for his employees is a publicity stunt?

2. What are the benefits and dis-benefits of flexible working to employers and employees?