Question: 1. Let the training examples be x (1) = [1, 0.5, 3]>, x (2) = [1, 0.2, 1]>, y (1) = 5, y (2) =

1. Let the training examples be x (1) = [1, 0.5, 3]>, x (2) = [1, 0.2, 1]>, y (1) = 5, y (2) = 2. Write the MSE loss function J() with the parameters and the above training data. 2. Derive the gradient (a 3 dimensional column vector) of J() with respect to . Need to derive the partial derivatives of J() with respect to each element of explicitly. Then write the gradient vector in the form of a matrix-vector product plus a constant vector. We called this the "vectorized" gradient that will be useful for gradient descent. 3. Use matrix calculus to derive the gradient of J() with respect to . The gradient must use the design matrix with the i-th rows being (x (i) ) >, i = 1, 2, and the target value vector y = [y (1), y(2)] >. Please don't derive it based on partial derives as in the above question. 4. Use matrix calculus to find the gradient of the gradient J() . This is the so-called "second-order derivatives" or the Hessian matrix H. Show that this matrix is positive-semidefinite (PSD), defined as for any vector u R 3 , u >Hu 0. Note: the result is a 3-by-3 matrix.

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Mathematics Questions!

In details only, please! Thanks! A: Diseases B: Symptom P(A) a a1 P(BIA) P(BA) b 0.2 bi 0.3 0.7 ao 0.8 B a1 0.4 0.6 Figure 3: Probabilistic Graph of Diseases and Sympton 7. Use the information...

re Regular Languages and Finite Automata (a) Let L be the set of all strings over the alphabet {a, b} that end in a and do not contain the substring bb. Describe a deterministic finite automaton...

Matlab Please help A C E 18) In algebra you learned that you can translate, or shift, a function y = f(x) to the right a distance a by y = f(2-a), and upward by y=f(x) +a. Also, you can expand, or...

1) What factors might have enabled JLR to raise new debt at less than half the coupon rate of interest in 2015, compared with the debt issued in 2011? 2) Compute the amount at which existing...

Need an detailed analysis of what this code/model is doing. Thanks #Step 1: Import the required Python libraries: import numpy as np import matplotlib.pyplot as plt import keras from keras.layers...

Using the code below, I need help with the following 3 things: 1) Write Python code to plot the images from the first epoch. Take a screenshot of the images from the first epoch. 2) Write Python code...

I really need help on these problems. I know it's a lot, but I can't split the question. \fGiven that f [I] and its inverse, f '1 [3} are differentiable functions with f {z} and j" {at} values at x =...

Thank u guys:) This is the reference pages but the questions are the last 3 pictures . . . . Craft - Mason Wage rate VET $29.00 Hours worked 50 hours per week for 20 weeks and 40 hours per week for...

Using the code below, I need help with creating code for the following: 1) Write Python code to plot the images from the first epoch. Take a screenshot of the images from the first epoch. 2) Write...

A family has two children. If the genders of these children are listed in the order they are born, there are four possible outcomes: BB, BG, GB, and GG. Assume these outcomes are equally likely. Let...

If $4 million is transferred from checkable deposits into money market mutual funds (retail), what happens to both M1 and M2?

How does the Dodd- Frank Act affect the auditing profession?

A financial statement line item expressed as a percentage of a base amount is a result of: A . comparative analysis. B . vertical analysis. C . horizontal analysis. D . economic value added.

If the odds ratio is equal to 1, this means: (mark all that apply) a. The odds of an outcome within at least one of the groups is 50% b. The odds of the outcome is the same between two comparison...