Question: Please help fast urgent. Please asnwer BOTH QUESTIONS in python. Dataset 1 has four columns that are called Mood, Effort, Score, Output. The mood is
Please help fast urgent. Please asnwer BOTH QUESTIONS in python. Dataset has four columns that are called Mood, Effort, Score, Output. The mood is either Happy, Neutral or Sad. The effort is either High, Medium, or Low. The Score is numbers between The output has only two values that are yes or no There are total of rows. Dataset has six columns called Age, Sex, BP Cholestrol, NatoK Output. The age has values between The Sex is either F or M BP is either High, Normal, or Low. Cholestrol is either High, or Normal. Natok has float values from The output is either drugA, drugB, drugC, drugX, drugY. It has rows. Please help fast urgent.Problem : marks
Inspect the dataset titled labdatasetcsv which has a mixture of numerical and
categorical data. Your task will be to write a function myID which can create a decision
tree for the given dataset using the ID algorithm. However, before doing that, you will be
have to perform some data processing tasks. Here are all the required tasks in order
ID cannot handle continuous numerical data. Perform necessary operations to
handle all continuousvalued attributes. Do not forget to show the output ie the
updated dataset after handling continuousvalued attributes. marks
Next, you will have to ensure the newly obtained dataset is optimal and free of
errors. Take appropriate actions based on the outcomes.
a Check if the dataset has any missing values. mark
b Check if the dataset has any redundant or repeated input sample. mark
c Check if the dataset has any contradicting YProblem : marks
Inspect the dataset titled labdatasetcsv which also has a mixture of numerical and
categorical data. For this problem, you will use decision tree classifiers for supervised
learning. In particular, you will be using the functionalities of the sklearn.tree library. The
classification task using sklearn libraries work only on numericalvalued attributes, and not
on categorical ones. What to do now? Hint: Look up Onehot Encoding and Integer
Encoding Here are all the required tasks
Restructure the dataset such that it has all numericalvalued attributes. marks
Perform supervised learning using decision tree classifiers. Employ the traintest
split approach during the learning. marks
After the learning is complete, show the results by predicting the class of the test
set. Display the results of the prediction and test set sidebyside. marks
Output the decision tree; it can be either a textual representation or a graphical
representation. marksProblem : marks
Inspect the dataset titled labdatasetcsv which has a mixture of numerical and
categorical data. Your task will be to write a function myID which can create a decision
tree for the given dataset using the ID algorithm. However, before doing that, you will be
have to perform some data processing tasks. Here are all the required tasks in order
ID cannot handle continuous numerical data. Perform necessary operations to
handle all continuousvalued attributes. Do not forget to show the output ie the
updated dataset after handling continuousvalued attributes. marks
Next, you will have to ensure the newly obtained dataset is optimal and free of
errors. Take appropriate actions based on the outcomes.
a Check if the dataset has any missing values. mark
b Check if the dataset has any redundant or repeated input sample. mark
c Check if the dataset has any contradicting
Step by Step Solution
There are 3 Steps involved in it
1 Expert Approved Answer
Step: 1 Unlock
Question Has Been Solved by an Expert!
Get step-by-step solutions from verified subject matter experts
Step: 2 Unlock
Step: 3 Unlock
