Question: ( b ) Decision tree is a straightforward algorithm and easy to understand in classification problem. Entropy is a measure of purity and it can

(

)

Decision tree is a straightforward algorithm and easy to understand in classification

problem. Entropy is a measure of purity and it can be used to calculate the expected

information for an attribute

(

Fig

. 3

) .

Entropy:

E (S) = -_{i = 1}^{c} p_{i} l o g_{2} p_{i}

where

S

is training set,

p_{i}

is the probability of examples in class

i,

and

c

is the

number of class labels.

Expected information of an attribute:

E (T, x) =_{x i n x}^{?} p_{x} E (x)

where

T

is the training set,

x

is the attribute,

x

is a label in attribute

x, p_{x}

is the

probability of examples in the training set, and

E (x)

is the entropy of the label of

attribute

x .

Fig.

3

Information gain can determine which attribute is better for being a splitting node by

subtracting the information before and after splitting, i

.

.

info

_

gain

(T, x) = E (T) -

E (T, x) .

Based on the dataset in Table

3,

determine the information gain of each input

attribute and suggest which input attribute should be the root node of a decision tree.

Show your steps clearly.

( b ) Decision tree is a straightforward

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Programming Questions!

Decision Trees ( DTs ) are a non - parametric supervised learning method used for classification and regression. The goal is to create a model that predicts the value of a target variable by learning...

give detailed conclusions The dataset that Ford has provided consists of about 600.000 data points. As we have already said there are three types of attributes - physiological (P1-P), environmental...

Capital Budgeting West Fraser Timber Co. Ltd Capital Budgeting West Fraser Timber Co. Ltd West Fraser is a firm based in Vancouver Island, BC. The company has four plants in British Columbia, and...

Jupyter Notebook Now that we have tried our hand at some single-layer nets, let's see how they stack up compared to multi-layer nets. :) We will be exploring the basic concepts of learning non-linear...

A creative engineer suggests structuring the TLB so that not all the bits of the presented address need match to result in a hit. Suggest how this might be achieved, and what might be the costs and...

C ase Study: Process Control in a Coffee Roasting PlantNestl , one of the largest food and beverages companies in the world, uses a number of continuous-feed coffee roasters to produce a variety of...

Portray in words what transforms you would have to make to your execution to some degree (a) to accomplish this and remark on the benefits and detriments of this thought.You are approached to compose...

Briefly describe ASCII and Unicode and draw attention to any relationship between them. [3 marks] (b) Briefly explain what a Reader is in the context of reading characters from data. [3 marks] A...

Prolog You are approached to compose a Prolog program to work with twofold trees. Your code shouldn't depend on any library predicates and you ought to expect that the mediator is running without...

Suppose that R(A, B, C) is a relational schema with functional dependencies F = {A, B C, C B}. (i) Is this schema in 3NF? Explain. [2 marks] (ii) Is this schema in BCNF? Explain. [2 marks] (b)...

Healthy Food is a restaurant that is located in the center of Hana Town. It provides a variety of delicious food that customers can choose from. Competition is very intense in this area of town so,...

A college's economics department is attempting to determine if verbal or mathematical proficiency is more important for predicting academic success in the study of economics. The department faculty...

According to this week's reading material, what are the potential Future Risks involved with finance? Depression Amnesia Leverage and Liquidity Asset Bubbles

Assume that Sivart Corporation has 2022 taxable income of $1,750,000 for purposes of computing the $179 expense and acquired several assets during the year. Assume the delivery truck does not qualify...