Question: You have a decision tree algorithm and you are trying to figure out which attribute is the best to test on first. You are using

You have a decision tree algorithm and you are trying to figure out which attribute is the best to test on first. You are using the information gain metric.

You are given a set of 128 examples, with 64 positively labeled and 64 negatively labeled.

There are three attributes: Homeowner (H), In Debt (ID), and Rich (R).

For 64 examples, Home Owner is true. The Homeowner=true examples are 1/4 negative and 3/4 positive.

For 96 examples, In Debt is true. Of the In Debt=true examples, 1/2 are positive and half are negative.

For 32 examples, Rich is true. 3/4 of the Rich=true examples are positive and 1/4 are negative

You must show all mathematical calculations/steps to get full points for each subpart (a) (d) below. Just writing the final answer in each subpart (correct or not) will get zero points.

a)What is the entropy of the initial set of examples?

b) What is the information gain of splitting on the Home Owner attribute as the root node?

c)What is the information gain of splitting on the In Debt attribute as the root node?

d) What is the information gain of splitting on the Rich attribute as the root node?

e) Which attribute do you split on?

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!

Our textbook describes in its Section 8.2.1 a basic algorithm for inducing a decision tree from training tuples. The algorithm follows a top down approach, similar to how decision trees are...

Decision Trees ( DTs ) are a non - parametric supervised learning method used for classification and regression. The goal is to create a model that predicts the value of a target variable by learning...

Classification is a form of data analysis where a model or classifier is constructed to predict class labels or types. Data classification is a two-phase process, 1) a learning phase in which the...

Problem Our textbook describes in its Section 8.2.1 a basic algorithm for inducing a decision tree from training tuples. The algorithm follows a top down approach, similar to how decision trees are...

Problem A basic algorithm, for inducing a decision tree from training tuples, follows a top down approach, similar to those constructed by algorithms such as ID3, C4.5 and CART. Part 1 The execution...

Classification is a form of data analysis where a model or classifier is constructed to predict class labels or types. Data classification is a two-phase process, 1) a learning phase in which the...

A classification is a form of data analysis where a model or classifier is constructed to predict class labels or types. Data classification is a two-phase process, 1) a learning phase in which the...

I need some serious help with this decision tree algorithm for my machine learning class. Be sure to read the code requirements below, because i need to write some parts from scratch. Any and all...

can someone please help me with the following java question please Algorithm Description: .The algorithm is called with three parameters: D, attribute list, and Attribute selection method. We refer...

Below at the bottom of the question is supposed to be a Java program that implements the basic algorithm for inducing a decision tree. With the requirements: You are not required to fully implement...

Energies of a spherical quantum dot (a) Derive the formula (63) for the charging energy. (b) Show that, for d

A growing number of organizations are using cloud computing as a viable alternative for their IT resource needs. Cloud computing allows organizations to increase their ability to meet computing...

A key concern in any merger or acquisition is the legality of the purchase. True False Clear selection

Seved Help 14 Wisconsin Snowmobile Corp. is considering a switch to level production Cost efficiencies would occur under level production, and aftertax costs would decline by $31,500, but inventory...

Assume that the banking system initially has no excess reserves and that the reserve requirement is 10 percent. Also assume that velocity is constant and that the economy initially is operating at...

KEY QUESTION Graph the accompanying demand data, and then use the midpoint formula for E d to determine price elasticity of demand for each of the four possible $1 price changes. What can you...

LAST WORD Compare and contrast the Taylor rule for monetary policy with the older, simpler monetary rule advocated by Milton Friedman.