Question: ( 4 points ) Derive d e l J d e l W i j . ( 2 points ) Write d e l J
points Derive
points Write as an outer product of two vectors. is a matrix with the same dimen
sions as ; it is just like a gradient, except that and are matrices rather than vectors.
points Derive Problem
You want to train a neural network to recognize the age and sex based on the face image. Your
training data consists of grayscale pixel images. The training labels include the age and sex
of each person. Your neural network consists of an input layer with units, a hidden
layer with units, and an output layer with units. You use the ReLU activation function for
the hidden units and no activation function for the outputs or inputs You train your network
with the cost function Please use the following notations to answer questions:
is a training image input vector with a component appended to the end as a bias is
a training label input vector, and is the output vector. All vectors are column vectors.
max is the ReLU activation function, is its derivative if other
wise and is applied componentwise to a vector.
is the vector of hidden unit values before the ReLU activation functions are applied, and
is the vector of hidden unit values after they are applied but we append a com
ponent to the end of as the bias term
is the weight matrix mapping the input layer to the hidden layer;
is the weight matrix mapping the hidden layer to the output layer;
Questions:
points Calculate the number of parameters weights in this network. You can leave your
answer as an expression.
Step by Step Solution
There are 3 Steps involved in it
1 Expert Approved Answer
Step: 1 Unlock
Question Has Been Solved by an Expert!
Get step-by-step solutions from verified subject matter experts
Step: 2 Unlock
Step: 3 Unlock
