2 Decision Tree ( Theory ) ( Matthew ) 1 0 pts In class, we primarily used information gain ( the overall reduction in entropy ) as the criterion for selecting which feature to split on Another method would be to use the Gini index For multi class, the Gini index is calculated as follows, where pk represents the fraction of samples in class k and p represents the set of all probabilities pk Gini index ( p ) 1 X K k 1 p 2 k Consider a dataset comprising 1 2 0 0 data points, with 4 0 0 data points from class C 1 , 5 0 0 data points from class C 2 , and 3 0 0 data points from class C 3 Suppose that decision tree model A splits the data into three leaves Assume that the label distribution is ( 3 0 0 , 5 0 , 5 0 ) at the first leaf, ( 5 0 , 4 0 0 , 5 0 ) at the second leaf, and ( 5 0 , 5 0 , 2 0 0 ) at the third leaf, where ( n 1 , n 2 , n 3 ) denotes the number of points from C 1 , C 2 , and C 3 , respectively Similarly, suppose that decision tree model B splits the data into ( 3 0 0 , 0 , 1 0 0 ) at the first leaf, ( 1 0 0 , 4 0 0 , 0 ) at the second leaf, and ( 0 , 1 0 0 , 2 0 0 ) at the third leaf Answer the questions below, showing all your work ( a ) Evaluate the misclassification rates for both decision trees and hence show that they are equal ( b ) Evaluate and compare the Weighted Gini index for the two trees which performs better in terms of Weighted Gini index The Weighted Gini index is calculated as Weighted Gini X j 1 nj N Gini ( pj ) where nj N is the weight of node j , calculated as the proportion of data points nj in node j to the total number of data points N , Gini ( pj ) is the Gini index of leaf j , and is the total number of leaves in the tree

The Answer is in the image, click to view ...

Question: 2 Decision Tree ( Theory ) ( Matthew ) - 1 0 pts In class, we primarily used information gain ( the overall reduction in

2

Decision Tree

(

Theory

) (

Matthew

) - 10

pts

In class, we primarily used information gain

(

the overall reduction in entropy

)

as the criterion for selecting

which feature to split on

.

Another method would be to use the Gini index. For multi

-

class, the Gini index

is calculated as follows, where pk represents the fraction of samples in class k and p represents the set of all

probabilities pk:

Gini index

(

) = 1

= 1

2

.

Consider a dataset comprising

1200

data points, with

400

data points from class C

1, 500

data points

from class C

2,

and

300

data points from class C

3 .

Suppose that decision tree model A splits the data into

three leaves. Assume that the label distribution is

(300, 50, 50)

at the first leaf,

(50, 400, 50)

at the second

leaf, and

(50, 50, 200)

at the third leaf, where

(

1,

2,

3)

denotes the number of points from C

1,

2,

and

3,

respectively. Similarly, suppose that decision tree model B splits the data into

(300, 0, 100)

at the first

leaf,

(100, 400, 0)

at the second leaf, and

(0, 100, 200)

at the third leaf. Answer the questions below,

showing all your work.

(

)

Evaluate the misclassification rates for both decision trees and hence show that they are equal.

(

)

Evaluate and compare the Weighted Gini index for the two trees

which performs better in terms of

Weighted Gini index? The Weighted Gini index is calculated as:

Weighted Gini

=

= 1

Gini

(

)

where nj

is the weight of node j

,

calculated as the proportion of data points nj in node j to the total number

of data points N

,

Gini

(

)

is the Gini index of leaf j

,

and

is the total number of leaves in the tree.

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Programming Questions!

2. Practicum Problems It is suggested that a Jupyter/IPython notebook be used for the programmatic components. 2.1 Problem 1 Load the iris sample dataset from sklearn (load_iris()) into Python using...

Briefly describe ASCII and Unicode and draw attention to any relationship between them. [3 marks] (b) Briefly explain what a Reader is in the context of reading characters from data. [3 marks] A...

A creative engineer suggests structuring the TLB so that not all the bits of the presented address need match to result in a hit. Suggest how this might be achieved, and what might be the costs and...

Suppose that R(A, B, C) is a relational schema with functional dependencies F = {A, B C, C B}. (i) Is this schema in 3NF? Explain. [2 marks] (ii) Is this schema in BCNF? Explain. [2 marks] (b)...

Portray in words what transforms you would have to make to your execution to some degree (a) to accomplish this and remark on the benefits and detriments of this thought.You are approached to compose...

Edit the code the python code below, to be one of a kind. extract training size csv, criterion csv, gini entropy csv, min leaf and min split csv and max depth csv extra 5 pounds for your troubles....

Below at the bottom of the question is supposed to be a Java program that implements the basic algorithm for inducing a decision tree. With the requirements: You are not required to fully implement...

Hi, This subject is financial accounting, here is a short essay type question, approximately 5 paragraphs. ''Drawing on private interest theory, what powers do you believe the Australian Accounting...

Module 9 Assignment: TOC Answer all the questions and submit your answer report to Module 9 Assignment in Dropbox by the deadline . The report should be typed, single spaced, in one MS Word file. You...

Hello, I have assessment.. Step 1 : Calculate the following ratios for the latest five years using the O-S Checklist Approach, an Excel template . For an overview of the approach, read our article,...

You are the research manager within your organization, and you have been grappling with a problem that needs further research. Your immediate supervisor is interested in using experimentation and...

In what order would the following amino acids be eluted with a buffer of pH 4 from a column containing an anion-exchange resin? histidine, serine, aspartate, valine

Capital Structure In Class Questions Question 2 0 1 pts A small group of investors using borrowed funds to purchase all the shares of a publicly traded company is called: initial public offering...

Please: Write your answer as if you were explaining your answer to someone who is learning the material for the first time; Clearly answer the questions asked; Explain your rationale behind your answe