Question: a ) . Consider the following neural network architecture with one hidden unit and one output unit as follows: [ 4 Marks ] Assume a
a Consider the following neural network architecture with one hidden unit and one output unit as follows:
Marks
Assume a sequence length of size with each xi represented by a dimensional vector. Let represent a single hidden layer with
hidden units and tanh activation. Model predicts y using softmax function with representing the weight matrix and representing the
bias for node
What are the dimensions of W U b h d and the input matrix
ii What are the total number of parameters in the above architecture?
b Two friends Mr Raju and Mr Robert, while experimenting with a deep neural network found that their model is overfitting the given
training data. The friends referred to the available literature and materials and came up with a solution to reduce the overfitting. Mr Raju
decided to add drop out along with batch normalization to his model. Whereas Mr Robert decided to go with a deeper model hence he
added more layers. But he found that his model is struggling to train because of his hardware limitations So he decided to employ early
stopping. Assess whether the decisions Mr Raju and Mr Ravi are taking helps them to proceed in the night direction.
Marks
Step by Step Solution
There are 3 Steps involved in it
1 Expert Approved Answer
Step: 1 Unlock
Question Has Been Solved by an Expert!
Get step-by-step solutions from verified subject matter experts
Step: 2 Unlock
Step: 3 Unlock
