Question: Problem 1 (SUBOPTIMALITY OF ID3 FOR DECISION TREES) Consider the following training set, where X = {0,1}3 and Y = {0,1}: (1,1,1),1) ((1,0,0),1) (1,1,0), 0)

Problem 1 (SUBOPTIMALITY OF ID3 FOR DECISION TREES) Consider the following

Problem 1 (SUBOPTIMALITY OF ID3 FOR DECISION TREES) Consider the following training set, where X = {0,1}3 and Y = {0,1}: (1,1,1),1) ((1,0,0),1) (1,1,0), 0) (0,0,1),0) Suppose we wish to use this training set in order to build a decision tree of depth 2 (i.e., for each input we are allowed to ask two questions of the form (Pi = 0?) before deciding on the label). (a) Suppose we run the ID3 algorithm up to depth 2 (namely, we pick the root node and its children according to the algorithm, but instead of keeping on with the recursion, we stop and pick leaves according to the majority label in each subtree). Assume that the subroutine used to measure the quality of each feature is based on the entropy function (so we measure the information gain), and that if two features get the same score, one of them is picked arbitrarily. Show that the training error of the resulting decision tree is at least 1/4. (b) Find a decision tree of depth 2 that attains zero training error. Problem 1 (SUBOPTIMALITY OF ID3 FOR DECISION TREES) Consider the following training set, where X = {0,1}3 and Y = {0,1}: (1,1,1),1) ((1,0,0),1) (1,1,0), 0) (0,0,1),0) Suppose we wish to use this training set in order to build a decision tree of depth 2 (i.e., for each input we are allowed to ask two questions of the form (Pi = 0?) before deciding on the label). (a) Suppose we run the ID3 algorithm up to depth 2 (namely, we pick the root node and its children according to the algorithm, but instead of keeping on with the recursion, we stop and pick leaves according to the majority label in each subtree). Assume that the subroutine used to measure the quality of each feature is based on the entropy function (so we measure the information gain), and that if two features get the same score, one of them is picked arbitrarily. Show that the training error of the resulting decision tree is at least 1/4. (b) Find a decision tree of depth 2 that attains zero training error

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!

Question 3. Decision Tree Consider the following training set, where X = {0,1}* and Y = {0,1}: (1 1 1 1), 1) (O 1 0 1), 1) (O 1 1 0), 1) ((1 1 0 0), 1) (O 1 1 1), 0) (1 0 1 0), 0) (1 001), 0) (0000),...

Question 3. Decision Tree Consider the following training set, where X = {0,1}4 and Y = {0,1}: (1 1 1 1), 1) (O 1 0 1), 1) (o 1 1 0), 1) ((1 1 0 0), 1) (o 1 1 1), 0) ((1 0 1 0), 0) ((1 001), 0) ((0 0...

Consider the following training set, where X = f 0 ; 1 g 3 and Y = f 0 ; 1 g: ( ( 1 ; 1 ; 1 ) ; 1 ) ( ( 1 ; 0 ; 0 ) ; 1 ) ( ( 1 ; 1 ; 0 ) ; 0 ) ( ( 0 ; 0 ; 1 ) ; 0 ) Suppose we wish to use this...

1. For this study, you will first consider the following multiple linear regression model: = 0 + 1 + 2 + 3 + RST and TOTWRK (total work) are measured in minutes per week, while EDUC and AGE are...

1 Multivariable functions and vector geometry Due date and time: Wednesday, June 7th at 9:30 AM in lecture Please ensure that you have read and understood the document \"Homework Guidelines" on the...

Specification and Verification II Consider the following Verilog phrases: initial r = 0; always @(posedge clk) r = a + r; Write down a formula in logic that relates clk, a and r at a level of...

Attempt the following please; Univariate unconstrained maximization. (10 points) Consider the following maximization problem: max x f (x; x0) = exp((x x0)2) 1. Write down the first order conditions...

Peter Orbanz porbanz@stat.columbia.edu Statistical Machine Learning (W4400) Spring 2013 http://stat.columbia.edu/porbanz/teaching/W4400/ Benjamin Reddy bmr2136@columbia.edu Homework 3 Due: 29 October...

Hi, please provide detailed solution for these two questions. Thank you. RSM 333 Spring 2017 Assignment #2 Due Monday, February 6th @ 4pm in Commerce Office Problem #1: (20 marks) Project Analysis /...

You are driving east at 25.0 m/s as you notice an ambulance traveling west toward you at 35.0 m/s. The sound you detect from the sirens has a frequency of 300 Hz. (a) Is the true frequency of the...

The freezing point of water, HO, is 0.00C at 1 atmosphere. A nonvolatile, nonelectrolyte that dissolves in water is sucrose. A student dissolves 10.02 grams of sucrose, C12H22011 (342.3 g/mol), in...

16. Suppose Fiat recently entered into an Agreement and Plan of Merger with Case for $4.3 billion. Prior to the merger, the market for four-wheel-drive tractors consisted of five firms. The market...

Compared with half a century ago, adoption has become _ _ _ _ _ _ _ _ _ common, but it is more open and acceptabl e , so we probably discuss it _ _ _ _ _ _ _ . fill in the blanks more or much less or...

=+What information would you need about each of the people in the United States?

=+ a. How does this change affect the incentives for working?

=+ b. How might this change represent a trade-off between equality and efficiency?