Question: [ Linear Regression ] ( 8 pts ) Suppose we have a weight vector w = [ w 1 , w 2 ] T with

[

Linear Regression

] (8

pts

)

Suppose we have a weight vector

w = [w_{1}, w_{2}]^{T}

with input vectors

x_{n} i n R^{2}

and

y_{n} i n {0, 1} (y = w x = w_{1} x_{1} + w_{2} x_{2}) .

Let us initialize all the weights to be

0 .

Also,

suppose we have

N = 2

examples in our dataset:

{

[- 1, - 1]^{T}, y_{2} = 1) .

Work out on paper the process of training an

l_{2}

regularized

(= 1)

linear regression model with batch gradient descent

(

learning rate

= 1)

on the above

dataset for two epochs

(

steps

) .

(

) (1

pts

)

What is the value of the loss function at the beginning?

(

) (4

pts

)

What is the final state of the trained weight vector after

2

steps, and the

corresponding value of the loss function?

(

Hint: derive the partial derivative of

the loss function with respect to weights, and calculate their values after each

step

)

L = \frac{1}{N}_{i} = 1^{N} (y_{i} - w x_{i})^{2} + | | w | |^{2}

(

) (3

pts

)

Derive the formula of the closed

-

form solution for ridge regression.

(

Hint:

first write out the loss function in matrix form

) .

[ Linear Regression ] ( 8 pts ) Suppose we have a

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Programming Questions!

use matlab use matlab 2) Consider a binary classification problem. Let the input data be described by the data matrix X below with the ith column representing the ith input x(i). The corresponding...

Question 1. Use the perceptron algorithm to manually find a weight vector w= [w0,w1,w2] for linear classification of 3 data points (xiyi) satisfying sign(wxi)= yi. The input data are...

Problem 3 Many ML models are parametrized. As we saw in the case study on house price prediction, the parameters of a multivariate linear regression model consist of its coefficients (also called...

Stochastic Stochastic gradient descent ( SGD ) is an important optimization tool in machine learning, used every - where from logistic regression to training neural networks. In this problem, you...

3. Consider the linear regression model yi=xi0+ui,i=1,2,,N, where 0 is the k1 vector of unknown regression coefficients, {(xi,ui)}i=1N is a sequence of independent and identically distributed...

Multiple linear regression solves the minimisation problem min (y-XB)(y = X) = - ,...., where the vectors = (o ... Bp)", n min (Bo Xij;), Bo.B1Bp i=1 - T j=1 , y = (y ... Yn)" and X is an n (p+1)...

7. 9 points] Suppose we have the training data shown in Table from which we want to learn a linear regression model, parameterized by a weight vector w and a bias parameter b (a) 1 point Write down...

2 Support Vector Machine [ 3 0 pts ] Suppose we use suppor vector machines with the kernel: K ( x , x ' ) = { 1 i f x = x ' 0 o t h e r w i s e As we discussed in class, this corresponds to mapping...

using MNIST data set, I have 1000 train samples, 1984 test samples, x_train shape: (1000, 784) and y_train shape: (1000,) now how I will reshape the weight vector (w) and plot resulting "weight...

Question 2. (a) Use Python to compute the weight vector w = [ w 0 , w 1 , w 2 ] T for linear regression of N input data points ( x i , y i ) that minimizes the sum of squared error ( w T x i y i ) 2...

Explain how the money-creation multiplier is similar to the income-determination multiplier of Chapter 3.

Suppose a spacecraft of mass 27,000 kg is accelerated to 0.21c. (a) How much kinetic energy would it have? (b) If you use the classical formula for kinetic energy, by what percentage would you be in...

Support or reject the following claim by a patient billing supervisor:"When an employee crosses the line, she should be dealt with soon and strictly. There should be no excuse at all for any...

(a)You have been provided one-year data for an active portfolio and a benchmark portfolio. You have to write report wherein you are supposed to compare and contrast the active portfolio performance...