In this problem, we consider splitting when building a regression tree in the CART algorithm. We...
Fantastic news! We've Found the answer you've been seeking!
Question:
Transcribed Image Text:
In this problem, we consider splitting when building a regression tree in the CART algorithm. We assume that there is a feature vector X RP and dependent variable Ye R. We have collected a training dataset (x, y),..., (In, Yn), where x R and y; E R for all i = 1, ..., n. We also assume, for simplicity, that we are considering the initial split at the top (root node) of the tree. An arbitrary split simply divides the training dataset into a partition of size two. By appropriately reshuffling the data, we can represent this partition (again for simplicity) via two sub-datasets (x1, y),..., (TN, YN) and (TN+1, YN+1),..., (En, Yn) where N is the index of the last observation included in the first set. Assume throughout that our impurity function is the RSS error the standard choice for a regression tree. e) (10 points) Consider a modification of the regression tree algorithm such that, in addition to considering splits of the form described in the paragraph preceding part (d), we also consider splits of the form R(j,l,t) = {X : XjX < t} and R(j,l,t) = {X : XjX t} where j and e are the indices of two chosen features and t is a cutoff value for XjXe. Is it possible for these new splits to improve the regression tree? Explain. In this problem, we consider splitting when building a regression tree in the CART algorithm. We assume that there is a feature vector X RP and dependent variable Ye R. We have collected a training dataset (x, y),..., (In, Yn), where x R and y; E R for all i = 1, ..., n. We also assume, for simplicity, that we are considering the initial split at the top (root node) of the tree. An arbitrary split simply divides the training dataset into a partition of size two. By appropriately reshuffling the data, we can represent this partition (again for simplicity) via two sub-datasets (x1, y),..., (TN, YN) and (TN+1, YN+1),..., (En, Yn) where N is the index of the last observation included in the first set. Assume throughout that our impurity function is the RSS error the standard choice for a regression tree. e) (10 points) Consider a modification of the regression tree algorithm such that, in addition to considering splits of the form described in the paragraph preceding part (d), we also consider splits of the form R(j,l,t) = {X : XjX < t} and R(j,l,t) = {X : XjX t} where j and e are the indices of two chosen features and t is a cutoff value for XjXe. Is it possible for these new splits to improve the regression tree? Explain.
Expert Answer:
Answer rating: 100% (QA)
In the context of regression trees and the CART Classification and Regression Trees algorithm the primary goal is to find optimal splits that minimize ... View the full answer
Related Book For
Business Intelligence And Analytics Systems For Decision Support
ISBN: 9781292009209
10th Global Edition
Authors: Efraim Turban, Ramesh Sharda, Dursun Delen, Pearson Education Limited, Dennis G. Zill
Posted Date:
Students also viewed these programming questions
-
3. Please explain in detail each step and coding characters of below Python code. (10 points) #Declare variables to store the budget amount, # amount spent, difference, and total. budget = 0.0...
-
Calculate the internal rate of return on this investment An investment project has the following cash flows: _________________________________________________________________________ year 0...
-
Part of the auditor's unmodified (also known as clean) opinion states that the "financial statements present fairly the financial position, results of operations, and cash flows of the company"....
-
Computer Technologies provides maintenance service for computers and office equipment for companies throughout the Northeast. The sales manager is elated because she closed a $300,000 three-year...
-
For those who think electric cars are sissy, Keio University in Japan has tested a 22-ft long prototype whose eight electric motors generate a total of 590 horsepower. The Kaz cruises at 180 mi/h...
-
Find the equation of the plane passing through the origin and parallel to a. The xy-plane b. The plane x + y + z = 1
-
Bustamante & Sons (B&S), Inc. publishes a small line of textbooks for introductory undergraduate business courses. Deron Ackerman, Director of Sales for B&S, is now performing annual reviews of the...
-
Global Systems manufactures an optical switch that it uses in its final product. Global Systems incurred the following manufacturing costs when it produced 69,000 units last year: Globle system does...
-
The following information relate to the business of Katwishi as at 31-12-16 K'000 Purchases Sales Returns inwards Returns outwards Debtors 64,700 125,600 6,340 1,900 11,250 Creditors 7,900...
-
FIGURE CP7.58 shows three hanging masses connected by massless strings over two massless, frictionless pulleys. (a) Find the acceleration constraint for this system. It is a single equation relating...
-
Locate the 2 0 0 5 Supreme Court decision involving Arthur Andersen's appeal of its criminal conviction of destroying records. What is the Supreme Court's opinion with respect to record retention?...
-
A benefit of issuing a corporate bond for the corporation is The initial capital raised does not have to be repaid Bonds issued by a corporation are an asset of the business The interest paid on the...
-
Discussion of Lean Six Sigma Framework (For Consulting Project Only) and what types of research it is best suited to study is cussion of Lean Six Sigma Framework (For Consulting Project Only) and...
-
On May 31, 2020, the Corporation gave Bob Jones, a member of the board of directors who also has provided consulting services to the Corporation from time to time, an incentive stock option to...
-
When a balance sheet date falls between the date of a foreign currency denominated export sale and the date cash is collected on the foreign currency account receivable, the foreign currency account...
-
For your initial Discussion Board, we will explore the advantages of managing people well. For this discussion, select and respond to two (2) of the questions below: Select a manager from your life...
-
Expand the quotient by partial fractions. 3x+28 (x+1)(x+6) O a Ob Oc Od -5 2 X+1 X+6 5 2 X+1 X+6 5 X+1 25 X+1 + -2 X+6 10 X+6
-
After graduating from college and working a few years at a small technology firm. Preet scored a high-level job in the logistics department at Amex Corporation. Amex sells high-quality electronic...
-
What is PageRank algorithm? What is the relation between PageRank and citation analysis? How does Google use PageRank?
-
What is a spreadsheet add-in? How can add-ins help in DSS creation and use?
-
The terrorist attack on the World Trade Center on September 11, 2001, underlined the importance of open source intelligence. The USA PATRIOT Act and the creation of the U.S. Department of Homeland...
-
For the composite properties and environmental conditions described in Examples 3.6, 4.7, and 5.3, determine the hygrothermally degraded values of the longitudinal and transverse tensile strengths....
-
The filament-wound E-glass/epoxy pressure vessel described in Example 4.4 is to be used in a hot-wet environment with temperature \(T=100^{\circ} \mathrm{F}\) \(\left(38^{\circ} \mathrm{C} ight)\)...
-
A carbon/epoxy lamina is clamped between rigid plates in a mold (Figure 5.17), while curing at a temperature of \(125^{\circ} \mathrm{C}\). After curing, the lamina/mold assembly (still clamped...
Study smarter with the SolutionInn App