Question: Please solve this ERROR!!!!!!! KeyError Traceback ( most recent call last ) in ( ) 9 0 9 1 # Build the decision tree -

Please solve this ERROR!!!!!!!

KeyError Traceback

(

most recent call last

)

()

90

91

# Build the decision tree

- - - > 92

decision

_

tree

=

build

_

decision

_

tree

(

)

93

94

# Print the decision tree

1

frames

in build

_

decision

_

tree

(

data

,

tree

)

78

# Set the outcome for the current branch

79

if len

(

counts

) = = 1

- - - > 80

tree

[

best

_

split

_

attribute

] [

value

] =

outcomes

[0]

81

else:

82

tree

[

best

_

split

_

attribute

] [

value

] =

outcomes

[

index

]

KeyError: 'House Type'

I checked my csv file but House Type data is well contained in the csv file.

= = = = = = = = = = = = = = = = = = = = = = = = = = = =

import pandas as pd

import numpy as np

# Load dataset into a pandas dataframe

=

.

read

_

csv

('

dataset

.

csv

')

# Define a function to calculate entropy

def entropy

(

target

_

col

)

elements, counts

=

.

unique

(

target

_

col, return

_

counts

=

True

)

probs

=

counts

/

len

(

target

_

col

)

entropy

=

.

sum

(-

probs

*

.

log

2 (

probs

))

return entropy

# Define a function to calculate information gain

def info

_

gain

(

data

,

split

_

attribute

_

name, target

_

name

=

"Outcome"

)

# Calculate the entropy of the entire dataset

total

_

entropy

=

entropy

(

data

[

target

_

name

])

# Calculate the values and corresponding counts for the split attribute

vals, counts

=

.

unique

(

data

[

split

_

attribute

_

name

],

return

_

counts

=

True

)

# Calculate the weighted entropy of the split data

weighted

_

entropy

=

.

sum

([(

counts

[

] /

.

sum

(

counts

)) *

entropy

(

data

.

where

(

data

[

split

_

attribute

_

name

] = =

vals

[

]) .

dropna

() [

target

_

name

])

for i in range

(

len

(

vals

))])

# Calculate the information gain

info

_

gain

=

total

_

entropy

-

weighted

_

entropy

return info

_

gain

# Define a function to get the best split attribute

def get

_

best

_

split

(

data

)

# Get the list of column names

columns

=

list

(

data

.

columns

)

# Remove the target column name

columns.remove

('

Outcome

')

# Calculate the information gain for each column

info

_

gains

= [

info

_

gain

(

data

,

column

)

for column in columns

]

# Get the index of the column with the highest information gain

best

_

column

_

index

=

.

argmax

(

info

_

gains

)

# Return the name of the best split attribute

return columns

[

best

_

column

_

index

]

# Define the decision tree building function

def build

_

decision

_

tree

(

data

,

tree

=

None

)

# Get the best split attribute

best

_

split

_

attribute

=

get

_

best

_

split

(

data

)

# Get the unique values for the best split attribute

values

=

.

unique

(

data

[

best

_

split

_

attribute

])

# Create a new tree node with the best split attribute

if tree is None:

tree

= {}

tree

[

best

_

split

_

attribute

] = {}

# For each value of the best split attribute, create a new branch

for value in values:

# Create a new branch for the current value

sub

_

data

=

data.where

(

data

[

best

_

split

_

attribute

] = =

value

) .

dropna

()

# Get the most common outcome for the current branch

outcomes, counts

=

.

unique

(

sub

_

data

['

Outcome

'],

return

_

counts

=

True

)

index

=

.

argmax

(

counts

)

# Set the outcome for the current branch

if len

(

counts

) = = 1

tree

[

best

_

split

_

attribute

] [

value

] =

outcomes

[0]

else:

tree

[

best

_

split

_

attribute

] [

value

] =

outcomes

[

index

]

# Recursively build the subtree for the current branch

if len

(

sub

_

data.drop

(

columns

= [

best

_

split

_

attribute, 'Outcome'

])) > 0

subtree

=

build

_

decision

_

tree

(

sub

_

data.drop

(

columns

= [

best

_

split

_

attribute

]), {})

tree

[

best

_

split

_

attribute

] [

value

] =

subtree

# Return the tree

return tree

# Build the decision tree

decision

_

tree

=

build

_

decision

_

tree

(

)

# Print the decision tree

decision

_

tree

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Programming Questions!

Question below is the python code for tree map. I have connected to sql database. However, when I am trying to execute the below code on jupyterhub, it throws up key error. Whereas, the variables...

Q 4 . Please create two bar plots as per below that show: 1 ) The number of individuals who have a High School Graduate Diploma AND earn

I don't know what to do for question 1 and 2. How do I use format with dictionary to compelete the andwer, like those example KeyError: 'pet_name The excersises in this section will combine the above...

Hi, I need help for this question: Which training examples have nonzero slack? List their coordinates. I have make few attempt on the code but still get wrong and the result shown an error. Please...

how to solve th is error this is also related to same project where i have sent you previous doubts please give the solution for error NameError Traceback (most recent call last) 7...

python please It can be inconvenient to build a linked chain of nodes by manually connecting them one after another. A more convenient approach would be to have a function that takes a Python list as...

Hi, I have the problem with the same question. At first, I got the answer from Chegg here. After my two times submission I still got the same error. Please take note on the DECISIONTREE CLASSIFIER...

Hi, I need help for the below question, the answer for the below question was I got it in Chegg here. After submission for the code, I received an error. Please help me on the coding. Thank you so...

In Python Please help with my HW, I am trying to make a dfs, but I am receiving an error. I believe the error is within the graph list but I am not sure. ( on the list)The outside number represents...

i am trying to solve the following questions but i get an error when trying to merge the dataframes that i separate, the error is at the bottom, i am using python: import matplotlib.pyplot as plt...

If a sample mean is 1,000 and the sample standard deviation is 250, determine the standardized value for a. x = 800 b. x = 1,200 c. x = 1,000

What is a functional strategy?

Evaluating employees is a skill best developed using the sink or swim approach. Select one: True False

A city has found that the rate its population is changing is given by p(t) people per year where t is measured in years since 2000. What is the best interpretation of 12 Lp(t)dt p(t)dt = - - 1520?...