Question: PLEASE DO NOT USE AI ! ( I ' ll know if you do ) Consider the following loss function on w i n R

PLEASE DO NOT USE AI!(I'll know if you do)
Consider the following loss function on winR4 :
L(w)=w12+w22+w32+2w1w2+w42+4w1+4w2+4
(a) What is gradL(w)? What is the minimum value of L(w)?
(b) Suppose we run gradient descent on L :
wt=wt-1-gradL(wt-1) for alt1
If the step size is >0 and w0=(0,0,1,1), what is wt after t step?
(c) Suppose we use gradient descent on L with ridge regularization:
L(w)=L(w)+||w||2
If >0,0, and w0=(0,0,1,1), what is wt after t step?
(d) Implement the gradient descent on L for the following nine combinations =1,0.1,0.01
and =0,1,10. Include your code and plot L(wt) where the x-axis is the number of step
and w0=(0,0,1,1).
PLEASE DO NOT USE AI ! ( I ' ll know if you do )

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Programming Questions!