Question: Question 3 . Consider a neural network model with one hidden layer and the following properties: The size of the input is 2 . The

Question 3. Consider a neural network model with one hidden layer and the following
properties:
The size of the input is 2.
The size of hidden layer is 3 with each node has activation function Sigmoid:
S(x)=11+e-x.
The output has size 1 with activation function tanh(x).
(a) Draw a computational graph for this neural network.
(Please write the answers on a piece of paper so that I can see the true graph and derivations in part (b). Thank you in advance!)
(b) The binary cross-entropy loss function for a label vector y and the prediction vector
hat(y) is defined as
L(y,hat(y))=-1li=1l[yi*log2(hat(y)i)+(1-yi)*log2(1-hat(y)i)]
Note that hat(y)i is the output of the neural network for input xi.
Calculate the gradient of L with respect to the weights of the first layer W(1). You
can use a matrix notation.
 Question 3. Consider a neural network model with one hidden layer

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!