Question: 2. Regression with Radial Basis Functions, 70 points In the previous case, we considered a nonlinear extension to linear regression using a linear combination of

2. Regression with Radial Basis Functions, 70 points In the previous case, we considered a nonlinear extension to linear regression using a linear combination of polynomial basis functions, where each basis function was introduced as a feature P(x) = xk. Now, we consider Gaussian radial basis functions of the form: $(x) = e-Y(x-4)2 whose shape is defined by its center u and its width y > 0. In the case of polynomial basis regression, the user's choice of the dimension d determined the transformation and the model. For radial basis regression, we have to contend with deciding how many radial basis functions we should have, and what their center and width parameters should be. For simplicity, let's assume that y = 0.1 is fixed. Instead of trying to identify the number of radial basis functions or their centers, we can treat **each data point as the center of a radial basis function**, which means that the model will be: e-y(x-x1) e-Y(x-x2) f(x) = [w], W2, W3 ..., wn]"e-(x-x2)2 le=(x-xw)2 This transformation uses radial basis functions centered around data points e-Y(x=x;)? and each basis function has a corresponding weight w; associated with it, for all i = 1,..., n. We transform each univariate data point x; into into a multivariate (n-dimensional) data point via $(x;) [..., e(x;=x;), ...). When this transformation is applied to every data point, it produces the radial-basis kernel: 1 e-> (x1-x2)2 e-Y(x1-x3)2 e-Y(x2-x]) e-Y(x2-x3)2 e-Y(x2-x.,) : : : e-Y(x,-x1) e->(x,-x2) e-Y(x2-x3)? 1 e-Y(x1-x)2 a. (15 points) Complete the Python function below that takes univariate data as input and computes a radial-basis kernel. This transforms one-dimensional data into n- dimensional data in terms of Gaussian radial-basis functions centered at each data point and allows us to model nonlinear (kernel) regression. In [ ]: # X float(n, ): univariate data # B float(n, ): basis functions # gamma float : standard deviation / scaling of radial basis kernel def radial_basis_transform(X, B, gamma=0.1): # # # *** Insert your code here *** # # b. (15 points) Complete the Python function below that takes a radial-basis kernel matrix o, the labels y, and a regularization parameter 1 > 0 as input and learns weights via ridge regression. Specifically, given a radial-basis kernel matrix o, implement the computation of w = (oo + 11,)-'oy In [ ]: # Phi float(n, d): transformed data # y float(n, ): Labels # Lam float i regularization parameter def train_ridge_model(Phi, y, lam): # # # *** Insert your code here *** # # c. (30 points) As before, we can explore the tradeoff between fit and complexity by varying a [10-3, 10-2 ..., 1, ... 103]. For each model, train using the transformed training data (0) and evaluate its performance on the transformed validation and test data. Plot two curves: (i) a vs. validation error and (ii) a vs. test error, as above. What are some ideal values of 1? In [ ]: # # *** Insert your code here *** # d. (10 points, Discussion) Plot the learned models as well as the true model similar to the polynomial basis case above. How does the linearity of the model change with 2? In [ ]: # # *** Insert your code here *** 2. Regression with Radial Basis Functions, 70 points In the previous case, we considered a nonlinear extension to linear regression using a linear combination of polynomial basis functions, where each basis function was introduced as a feature P(x) = xk. Now, we consider Gaussian radial basis functions of the form: $(x) = e-Y(x-4)2 whose shape is defined by its center u and its width y > 0. In the case of polynomial basis regression, the user's choice of the dimension d determined the transformation and the model. For radial basis regression, we have to contend with deciding how many radial basis functions we should have, and what their center and width parameters should be. For simplicity, let's assume that y = 0.1 is fixed. Instead of trying to identify the number of radial basis functions or their centers, we can treat **each data point as the center of a radial basis function**, which means that the model will be: e-y(x-x1) e-Y(x-x2) f(x) = [w], W2, W3 ..., wn]"e-(x-x2)2 le=(x-xw)2 This transformation uses radial basis functions centered around data points e-Y(x=x;)? and each basis function has a corresponding weight w; associated with it, for all i = 1,..., n. We transform each univariate data point x; into into a multivariate (n-dimensional) data point via $(x;) [..., e(x;=x;), ...). When this transformation is applied to every data point, it produces the radial-basis kernel: 1 e-> (x1-x2)2 e-Y(x1-x3)2 e-Y(x2-x]) e-Y(x2-x3)2 e-Y(x2-x.,) : : : e-Y(x,-x1) e->(x,-x2) e-Y(x2-x3)? 1 e-Y(x1-x)2 a. (15 points) Complete the Python function below that takes univariate data as input and computes a radial-basis kernel. This transforms one-dimensional data into n- dimensional data in terms of Gaussian radial-basis functions centered at each data point and allows us to model nonlinear (kernel) regression. In [ ]: # X float(n, ): univariate data # B float(n, ): basis functions # gamma float : standard deviation / scaling of radial basis kernel def radial_basis_transform(X, B, gamma=0.1): # # # *** Insert your code here *** # # b. (15 points) Complete the Python function below that takes a radial-basis kernel matrix o, the labels y, and a regularization parameter 1 > 0 as input and learns weights via ridge regression. Specifically, given a radial-basis kernel matrix o, implement the computation of w = (oo + 11,)-'oy In [ ]: # Phi float(n, d): transformed data # y float(n, ): Labels # Lam float i regularization parameter def train_ridge_model(Phi, y, lam): # # # *** Insert your code here *** # # c. (30 points) As before, we can explore the tradeoff between fit and complexity by varying a [10-3, 10-2 ..., 1, ... 103]. For each model, train using the transformed training data (0) and evaluate its performance on the transformed validation and test data. Plot two curves: (i) a vs. validation error and (ii) a vs. test error, as above. What are some ideal values of 1? In [ ]: # # *** Insert your code here *** # d. (10 points, Discussion) Plot the learned models as well as the true model similar to the polynomial basis case above. How does the linearity of the model change with 2? In [ ]: # # *** Insert your code here ***

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!

1. Regression with Polynomial Basis Functions, 30 points. This problem extends ordinary least squares regression, which uses the hypothesis class of linear regression functions, to non-linear...

deacribe the main findings of this paper 400 words (be clear on what the analysis involved says, interpret results, or if you could redo the study, what other varaibles or questions would you...

MATHEMATICS FOR MACHINE LEARNING Marc Peter Deisenroth A. Aldo Faisal Cheng Soon Ong Contents Foreword 1 Part I Mathematical Foundations 9 1 Introduction and Motivation 11 1.1 Finding Words for...

Math\t107-6381\t-\tQuiz\t#4\t-\tSchultz\t-\tDue\tFebruary\t21,\t2016\t-\tpage\t1\tof 3 Follow\tthese\tdirections\tcarefully. This\tquiz\tis\tdue\tby\t11:59\tEastern\ttime\ton\tFebruary\t21,\t2016. o...

13. PLEASE HELP!!! Assignment Submission For this assignment, you submit answers by question parts. The number of submissions remaining for each question part only changes if you submit or change the...

Logarithms and polynomials Interaction effects Dummy variables ECMT1020: Introduction to Econometrics Lecture 10: Multivar. regr. & transformations Peter Exterkate peter.exterkate@sydney.edu.au...

This question involves the use of AGGREGATE linear PYTHOIN regression on the Auto data set. (a) Perform a simple linear regression with mpg as the response and horsepower as the predictor. Describe...

Create charts to better understand data sets. For cross-sectional data, use a scatter chart. For time series data, use a line chart. Linear y = a + bx Logarithmic y = ln(x) Polynomial (2nd order) y =...

Math 636 Final - Quiz Component 1. The system of linear equations 2x2 x3 = 3 2x1 + x2 + 2x3 = 1 x1 + x2 + 3x3 = 2 (a) is consistent with a unique solution. (b) is consistent with infinitely many...

Demand and Supply Discussion Question: Applications-Ubereconomics Please read the article below and respond to the follow-up question: Why Uber Is an Economist's Dream Does 'surge pricing'...

Barbara Belmont, a partner in B&U CPAs, was recently in a discussion with a member of the board of directors of the Blue Grass Airport about changing audit firms. Certain negative events have led...

The book Survey Responses: An Evaluation of Their Validity, by E. J. Wentland and K. Smith (Academic Press), includes studies reporting the accuracy of answers to questions from surveys. A study by...

ross itincomnteresting eGoodwiltaxesShort - term debted expensestiesAt the beginningSharehoeabilities and equityS - year Financial Prodectind ( Bst ) mNHmoof Blue Sky Learning ( Bsi 2 0 1 1 to 2 0 1...

The number of vacancies present in some metal at 783C is 1.1 x 1024 m3. Calculate the number of vacancies at 605C given that the energy for vacancy formation is 1.11 eV/atom; assume that the density...

If the tax rate is 40 percent, compute the beforetax real interest rate and the after-tax real interest rate in each of the following cases. a. The nominal interest rate is 10 percent and the...

Assume that the reserve requirement is 20%. Also assume that banks do not hold excess reserves and there is no cash held by the public. The Federal Reserve decides that it wants to expand the money...

It is often suggested that the Federal Reserve try to achieve zero inflation. If we assume that velocity is constant, does this zero-inflation goal require that the rate of money growth equal zero?...