Question: Q 2 . Gradient Descent ( 2 pt ) Given N training data points { ( x k , y k ) } , k

Q

2 .

Gradient Descent

(2

pt

)

Given N training data points

{(x_{k}, y_{k})}, k = 1, 2,

dots,

N, x_{k}

in

R^{d},

and labels as

y_{k}

in

{- 1, 1} (

either

- 1

or

,

we seek a linear discriminant function

f (x_{k}) = w * x_{k} =_{j = 1}^{d} w_{j} x_{k, j} (

where

x_{k, j}

is the feature value

of attribute

j

of a data point

x_{k})

optimizing a special loss function

L (z) = e^{- z},

where

z = y f (x) .

Let

> 0

be the learning rate, please derive the gradient update

w_{k}

for a randomly selected data point

k

in the stochastic gradient descent

(

SGD

)

method.

Hint: Note that SGD randomly pick one data sample

k

for gradient update per iteration. We can write

z_{k} = y_{k} f (x_{k}) = y_{k} (_{j = 1}^{d} w_{j} x_{k, j})

where

x_{k, j}

is the feature value of attribute

j

of a data point

x_{k} .

You need to first write

w_{j} (

partial derivative with respect to attribute

j)

and then get

w_{k} (

the vector

consisting of partial derivatives with respect to the

d

attributes

) .

Q 2 . Gradient Descent ( 2 pt ) Given N training

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Programming Questions!

Q:

6 . Neural networks and backpropagation ( 1 0 points ) . Consider a simple two - layer network in the lecture slides. Given n training data ( x i , y i ) , i = 1 , . . . , n , the cost function used...

Q:

6 . Neural networks and backpropagation ( 1 0 points ) . Consider a simple two - layer network in the lecture slides. Given n training data ( x i , y i ) , i = 1 , . . . , n , the cost function used...

Q:

1. (50 points) Stochastic gradient descent for MLP. Given the following training set D = {z(), y)} for a three-class classification task: 2(1) = (-0.7411, -0.5078,-0.3206)",y(1) = [0,1,0, 2 (2) =...

Q:

X1 Problem 2) Gradient descent learning: Consider the following set of data points: input desired X2 label 1 1 1 1 0 1 0 1 0 -1 -1 0 -1 0 0 -1 1 0 As the above table shows, the data points are...

Q:

X1 X2 Problem 2) Gradient descent learning: Consider the following set of data points: input desired label 1 1 1 1 0 1 0 1 0 -1 - 1 0 - 1 0 0 -1 1 0 As the above table shows, the data points are...

Q:

please use Python. answer all parts! thank you X X2 Problem 2) Gradient descent learning: Consider the following set of data points: input desired label 1 1 1 1 0 1 0 1 0 - 1 - 1 0 - 1 0 0 - 1 1 0 As...

Q:

(b) Print the coefficients of the features in the model. Which features contribute mostly to the prediction? Which ones are positively correlated and which ones are negatively correlated with the...

Q:

(b) Print the coefficients of the features in the model. Which features contribute mostly to the prediction? Which ones are positively correlated and which ones are negatively correlated with the...

Q:

(b) Print the coefficients of the features in the model. Which features contribute mostly to the prediction? Which ones are positively correlated and which ones are negatively correlated with the...

Q:

(b) Print the coefficients of the features in the model. Which features contribute mostly to the prediction? Which ones are positively correlated and which ones are negatively correlated with the...

Q:

[a b Let A = d e f and B = 2d lg h -39 -3h -3i 2f |- 2d b - 2e -2f] 2e If determinant of A is equal to 2 then determinant of B is equal to -12 None of these -6 12 6.

Q:

Find the charges on the three capacitors shown in figure (31-W3a). 2 F 6V (a) 4 F 5F 6 V 2 F 6V 0-0 1 V 5F 6V (Q+Q) T-9 +9) 0 4 F -Q2 Q2 6V (b) 6 V

Q:

Which of the following best explains why WACC weights are based on market values rather than book values? Group of answer choices Market values are only used for debt, not equity Market values...

Q:

Explain the relationship between the constitution, codes and statutes and statutory decrees

Recommended Textbook

More Books

Mastering Your Iphone 11 Pro Max Iphone 11 Pro Max User Guide For Beginners New Iphone 11 Pro Max Users And Seniors

Authors: Tech Reviewer

1st Edition

1694849554, 978-1694849557

Ask a Question and Get Instant Help!