Question: 3 . Neural networks ( 1 0 points ) . Consider a simple two - layer network in the lecture slides. Given n training data

3. Neural networks (10 points). Consider a simple two-layer network in the lecture slides. Given n training data (x, y'), i =1,..., n, the cost function used to training the neural networks l(w,a,B)=(y (WTz:))? a = i=1= where o(x)=1/(1+e-2) is the sigmoid function, zi is a two-dimensional vector such that = g(a+c), and == g(812).(a)(5 points) Show the that the gradient is given by n al(w,a,)w =2(x g(x))(u)(1 F(x))=, i=1= where ui = wTzi. (b)(5 points) Also show the gradient of l(w,a,) with respect to a and B and write down their expression.

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!