Question: can you help me with problem 1? Problem 1: (30 points) Consider the algorithm for building a CART model in the case of regression. Following

can you help me with problem 1?

Problem 1: (30 points) Consider the algorithm for building a CART model in the case of regression. Following and ex- panding on the notation from class, suppose that our current tree, denoted by Told, has |T01d| = M terminal nodes/ buckets. For each bucket m = 1,. .. , M , let: 1. Nm denote the number of observations in bucket m , 2. Qm (Told) denote the value of the impurity function at bucket m , and 3. Rm denote the region in the feature space corresponding to bucket m . Also let N be the overall total number of observations. Recall that, in the case of regression we have that: Qaaa=eraaf. #324612": " L . - - where ym Nm 23-3163"; y% is the mean response 111 bucket 111. Then the total impurity cost of the tree Told is dened as: M %Mm=2m%mm m=1 Consider a potential split at the nal bucket M (we're using M just for ease of notation), which results in a new tree Tnew. This new tree has |Tnew| = M + 1 terminal nodes/ buckets, and for this new tree we let 1. rm denote the number of observations in bucket m , 2. Qm (Tnew) denote the value of the impurity function at bucket m , and 3. Rm denote the region in the feature space corresponding to bucket m . The total impurity cost of the tree Tnew is defined analogously as: M+1 Cimp(Tnew) = _ NmQm(Inew) . m=1 Please answer the following: a) (10 points) Let A = Cimp(Told) - Cimp (Tnew) be the absolute decrease in total impurity resulting from the split. Derive a formula for A that can be computed locally at the bucket M, in other words it should only depend on the data points that fall in region RM in the original tree Told. (Hint: we've discussed this concept in class, this question is asking for a more formal argument. You may assume that the two new buckets in Thew resulting from the split are labeled as buckets M and M + 1 in Thew.) b) (10 points) Show that A 2 0, hence splitting always reduces the total impurity cost. (Hint: you can use the fact that, given a sequence of real numbers z1, Z2, ..., Zn, the mean z = " Et Zi is the minimizer of the function RSS(z) = Et-1( zi - z)2) c) (10 points) Let Rod be the training set R2 value for the model defined by Told, and likewise let Rnew be the training set R2 value for the model defined by Thew. Let SST = Zi-1(yi -y)2 be the total sum of squared errors, where y = , ET_, yi is the overall mean. For a given value of the complexity parameter (cp) a 2 0, recall the modified cost function that is relevant in the pruning step: Ca(T) = Cimp (T) + a . SST . ITI Show that Ca(Tnew)

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Mathematics Questions!

Please answer all the question with a clear explanation, thank you for your work! Consider the algorithm for building a CART model in the case of regression. Following and ex- panding on the notation...

1 (Total points: 68) 1. This assignment is due by 8:30 am, October 7 (Wednesday). Please put it in my TA's assignment locker on 9/F of KKL Building. 2. You can do this assignment individually or in a...

Help tutors Problem 1. Shorter problems. (50 points) Solve the following shorter problems. 1. Compute the pure-strategy and mixed strategy equilibria of the following coordination game. Call u the...

I need some help with my work Problem 1. Shorter problems. (50 points) Solve the following shorter problems. 1. Compute the pure-strategy and mixed strategy equilibria of the following coordination...

You are to price options on a futures contract. The movements of the futures price are modeled by a binomial tree. You are given: (i) Each period is 6 months. (ii) u/d = 4/3, where u is one plus the...

Help on #7 Exercise 2: sorting algorithms, correctness and runtime (44 points). Consider an algorithm INSERTION-MERGE-SORT with the following INSERTION-MERGE-SORT(A, p,r) 1 ifp key A[i + 1]-6] 6 A[i...

Hi, I am stuck with these questions that are attached below. Could you please help me to solve them? Thank you so much! AutoSave ( OFF FXCostSp2020POST Q Q v Home Insert Draw Page Layout Formulas...

CASE 3 Building a Coalition Learning Goals Many of the most important organizational behavior challenges require coordinating plans and goals among groups. This case describes a multi-organizational...

Kindly help me to solve the questions as attached below. 3. To slow down transmissions of the viral disea 'lockdown" hence ordering business to shut and the citizens to adhere to limiting physical...

*Here is what AOL stands for* *Here is the POP algorithm as explained in class* Switch Room 3 Door 3 Corridor Switch 2 Door 2 Room 2 Switch Shakey Box 2 Door 1 Room 1 Figure above shows the Shakey's...

A long distance runner runs 15 laps of a 400 metres track every day. His timings (in min) for five consecutive days are 44, 43, 39, 38 and 36 respectively. On an average, how many km/hr does the...

Recast the Statement of Activities of the American Red Cross in the pancake format shown in Illustration. See accompanying notes to the consolidated financialstatements. The American National Red...

21. Why is it necessary to develop a definitional framework for the basic elements of accounting?

A company issues 1,050 shares of its common stock for $33,600 cash. Prepare journal entries to record this event under each of the following separate situations.