Question 4. Consider k-nearest-neighbor (k-NN) or Kernel Regression with a Gaussian Kernel. Show how to control...
Fantastic news! We've Found the answer you've been seeking!
Question:
Transcribed Image Text:
Question 4. Consider k-nearest-neighbor (k-NN) or Kernel Regression with a Gaussian Kernel. Show how to control the bias-variance trade-off for either of the methods. That is, say you pick k-NN. Clearly, k is a paramter you control. Show, how the bias and variance changes as a result of changing k (similar paramter in kernel regression is h, see Q1.). Even though I am not looking for a formal proof, you need to be able to use the definition of bias and variance either on an example, by simulation or mathematically. You also need to be a able to give an intuitive explanation (something like, 'as we see here, as k increases bias .. because ...). Note I will accept any method's bias-variance behavior as long as you can show concretely that the behavior exists as a result of changing something in the method/model. 2. Bias-variance trade-off in the presence of multicollinearity. Let Y = X3 + e be a linear regression model in vector form. Here, Y, e € R", X is a known n xp predictor matrix, and 3 € RP is the parameter vector to be estimated. We assume E(e | X) = 0 and Var(e| X) = ³1. We showed in class that the least squares estimator is given by 3= arg min (Y-X 3)' (Y-X3) = arg min ||Y-XB| = (X'X)~¹X'Y, BERP BER which is the minimum variance unbiased estimator. Unbiasedness is a desirable property, but it may come at a cost of variance. Note that the estimator 3 and its variance both depend on (X'X)-¹. In- verting the matrix X'X is problematic if it is ill-conditioned, or in the regression setting, if the predictor variables exhibit multicollinearity (that is, if one pre- dictor variable can be linearly predicted by the others with a high degree of accuracy). One approach to alleviate this problem is to add an F-penalty to the sum of squares criterion. This is called ridge regression: 3= arg min (Y-XB)' (Y-X3) + AS8= arg min | Y - XB||+X|||| SER SER where A is a fixed nonnegative constant called the ridge parameter. (a) Show that 3x = (X'X+XI)-¹X'Y. Although X'X may be ill-conditioned, the matrix X'X+XI becomes increasingly well-conditioned as A increases. (b) Calculate E(3x | X) and Var(3x | X). What can you say about the bias- variance trade-off for the ridge estimator 3 compared to the least squares estimator 3? Question 4. Consider k-nearest-neighbor (k-NN) or Kernel Regression with a Gaussian Kernel. Show how to control the bias-variance trade-off for either of the methods. That is, say you pick k-NN. Clearly, k is a paramter you control. Show, how the bias and variance changes as a result of changing k (similar paramter in kernel regression is h, see Q1.). Even though I am not looking for a formal proof, you need to be able to use the definition of bias and variance either on an example, by simulation or mathematically. You also need to be a able to give an intuitive explanation (something like, 'as we see here, as k increases bias .. because ...). Note I will accept any method's bias-variance behavior as long as you can show concretely that the behavior exists as a result of changing something in the method/model. 2. Bias-variance trade-off in the presence of multicollinearity. Let Y = X3 + e be a linear regression model in vector form. Here, Y, e € R", X is a known n xp predictor matrix, and 3 € RP is the parameter vector to be estimated. We assume E(e | X) = 0 and Var(e| X) = ³1. We showed in class that the least squares estimator is given by 3= arg min (Y-X 3)' (Y-X3) = arg min ||Y-XB| = (X'X)~¹X'Y, BERP BER which is the minimum variance unbiased estimator. Unbiasedness is a desirable property, but it may come at a cost of variance. Note that the estimator 3 and its variance both depend on (X'X)-¹. In- verting the matrix X'X is problematic if it is ill-conditioned, or in the regression setting, if the predictor variables exhibit multicollinearity (that is, if one pre- dictor variable can be linearly predicted by the others with a high degree of accuracy). One approach to alleviate this problem is to add an F-penalty to the sum of squares criterion. This is called ridge regression: 3= arg min (Y-XB)' (Y-X3) + AS8= arg min | Y - XB||+X|||| SER SER where A is a fixed nonnegative constant called the ridge parameter. (a) Show that 3x = (X'X+XI)-¹X'Y. Although X'X may be ill-conditioned, the matrix X'X+XI becomes increasingly well-conditioned as A increases. (b) Calculate E(3x | X) and Var(3x | X). What can you say about the bias- variance trade-off for the ridge estimator 3 compared to the least squares estimator 3?
Expert Answer:
Answer rating: 100% (QA)
Here is an explanation of controlling the biasvariance tradeoff in kNN and kernel regression as well ... View the full answer
Related Book For
Auditing and Assurance Services Understanding the Integrated Audit
ISBN: 978-0471726340
1st edition
Authors: Karen L. Hooks
Posted Date:
Students also viewed these finance questions
-
Management has asked you to provide a "snapshot" of its current sales and analyze the potential impacts of the economic contraction and the looming union settlement on Hera's earnings. What follows...
-
Design a Java class that represents a cache with a fixed size. It should support operations like add, retrieve, and remove, and it should evict the least recently used item when it reaches capacity.
-
Let A, B be sets. Define: (a) the Cartesian product (A B) (b) the set of relations R between A and B (c) the identity relation A on the set A [3 marks] Suppose S, T are relations between A and B, and...
-
Drag and drop the protocols from the bank to their respective layer on the OSI and TCP/IP Model. Not every layer will have protocols associated with it. Protocols TCP/IP Model OSI Model Application...
-
Suppose that it takes tensile forces of magnitude F to produce a given strain L / L in a steel wire of cross sectional area A. If you had two such wires side by side and stretched them...
-
Pillow block bearing Grinding wheel 60 lb R 20 in. a Steel shaft, E 29 E6 psi diameter = ? 12 in. b Find the diameter of the above circular shaft, that experiences a deflection of 0.00174 inches at...
-
Jenna Smith recently purchased an annuity contract that will pay her \(\$ 375,000\) per year for the next seven years. According to Smith's calculations, the estimated internal rate of return on this...
-
Spoilage in job costing. Whitefish Machine Shop is a manufacturer of motorized carts for vacation resorts. Peter Cruz, the plant manager of Whitefish, obtains the following information for Job #10 in...
-
In a negotiation process it is best to build a bridge by starting from your own position, otherwise you will not get what you want. Review Later True False
-
Name the fallacy in each statement below and explain why it is fallacious. 1. The best restaurant in New York city is either buddy's bistro or clyde's emporium. 2. Everyone is going to buddy's...
-
The code of a program as follows. #include #include #include #define SIZE 5 int nums[SIZE] = [0,1,2,3,4}; int main() { int i; pid t pid; pid= fork(); if (pid == 0) { for (i = 0; i < SIZE; i++) {...
-
A race car driving on a banked track that makes an angle \(\theta\) with the horizontal rounds a curve for which the radius of curvature is R. (a) As described in Problem 12, there is one speed...
-
A \(0.200-\mathrm{kg}\) small block is attached to an ideal spring with a spring constant of \(316 \mathrm{~N} / \mathrm{m}\) and is moving in simple harmonic motion on a horizontal frictionless...
-
A \(5.0-\mathrm{kg}\) cart is attached to a horizontal spring for which the spring constant is \(75 \mathrm{~N} / \mathrm{m}\). The system is set in motion when the cart is \(0.32 \mathrm{~m}\) from...
-
Sinusoidal water waves are generated by a 18 -W mechanical motor in an aquarium tank. How much the kinetic energy is in each \(1.25-\mathrm{Hz}\) wave generated? \(\cdot\)
-
In a particle accelerator, the particles in a beam of protons are traveling toward a target at a speed equal to \(0.100 c_{0}\). If the beam has a radius of \(0.100 \mu \mathrm{m}\) and carries a...
-
Consider a random variable X and its (partially known) p.m.f. px (x) = ? 1/2 ? 1/10 0 if x = 0 if x = 1 if x = 2 if x = 3 if x # {0, 1,2,3} Assume E(X)=0.8. Find the variance of X.
-
Why is homeostasis defined as the "relative constancy of the internal environments? Does negative feedback or positive feedback tend to promote homeostasis?
-
What is the auditor ultimately trying to determine regarding material related party transactions and the financial statements?
-
What reasons have been given for expanding auditor liability for negligence to foreseen users?
-
What is the difference between the business of a wholesale distributor and that of a retailer?
-
Refer to the data given for the Lowe Company in Problem P12-2B. Required a. Compute the change in cash that occurred during 2019. b. Prepare a statement of cash flows using the direct method. Problem...
-
Refer to the data given for the Madison Company in Problem P12-3B. Required a. Compute the change in cash that occurred during 2019. b. Prepare a statement of cash flows using the direct method. Use...
-
The financial statements for the Columbia Sportswear Company can be found in Appendix A at the end of this book. Required Answer the following questions: a. How much did Columbia Sportswear's cash...
Study smarter with the SolutionInn App