Consider the training examples shown in the following table for binary classification (class labels are mammals...
Fantastic news! We've Found the answer you've been seeking!
Question:
Transcribed Image Text:
Consider the training examples shown in the following table for binary classification (class labels are mammals and non-mammals): Give Birth yes no no yes no no yes no yes yes no no yes no no no no no yes no Can Fly no no no no no no yes yes no no no no no no no no no yes no Live in Water no no yes yes sometimes no no no no yes sometimes sometimes no yes sometimes no no no yes no Have Legs yes no no no yes yes yes yes yes no yes yes yes no yes yes yes yes no Class mammals non-mammals non-mammals mammals non-mammals non-mammals mammals non-mammals mammals non-mammals non-mammals non-mammals mammals non-mammals non-mammals non-mammals mammals non-mammals mammals non-mammals yes yes Table credit: Introduction to Data Mining (2nd Edition), by Tan, Steinbach, Karpatne, Kumar a. (15%) Compute the misclassification error rate for the entire training set and each attribute (i.e., Give Birth, Can Fly, Live in Water, Have Legs) b. (5%) Using misclassification error rate, which attribute is better to be used for splitting (to build a decision tree) and why? c. (15%) Compute the Gini index for the entire training set and each attribute (i.e., Give Birth, Can Fly, Live in Water, Have Legs) d. (5%) Using Gini index, which attribute is better to be used for splitting (to build a decision tree) and why? e. (40%) Build a decision tree with a depth of no more than 2 using the greedy approach and the Gini index as the splitting criterion. f. (20%) Based on the decision tree built in part (e), compute the confusion matrix, accuracy, precision, recall, and F1 for the training set. Consider the training examples shown in the following table for binary classification (class labels are mammals and non-mammals): Give Birth yes no no yes no no yes no yes yes no no yes no no no no no yes no Can Fly no no no no no no yes yes no no no no no no no no no yes no Live in Water no no yes yes sometimes no no no no yes sometimes sometimes no yes sometimes no no no yes no Have Legs yes no no no yes yes yes yes yes no yes yes yes no yes yes yes yes no Class mammals non-mammals non-mammals mammals non-mammals non-mammals mammals non-mammals mammals non-mammals non-mammals non-mammals mammals non-mammals non-mammals non-mammals mammals non-mammals mammals non-mammals yes yes Table credit: Introduction to Data Mining (2nd Edition), by Tan, Steinbach, Karpatne, Kumar a. (15%) Compute the misclassification error rate for the entire training set and each attribute (i.e., Give Birth, Can Fly, Live in Water, Have Legs) b. (5%) Using misclassification error rate, which attribute is better to be used for splitting (to build a decision tree) and why? c. (15%) Compute the Gini index for the entire training set and each attribute (i.e., Give Birth, Can Fly, Live in Water, Have Legs) d. (5%) Using Gini index, which attribute is better to be used for splitting (to build a decision tree) and why? e. (40%) Build a decision tree with a depth of no more than 2 using the greedy approach and the Gini index as the splitting criterion. f. (20%) Based on the decision tree built in part (e), compute the confusion matrix, accuracy, precision, recall, and F1 for the training set.
Expert Answer:
Answer rating: 100% (QA)
To solve this problem well follow these steps a Compute the misclassification error rate for the entire training set and each attribute b Determine wh... View the full answer
Related Book For
Introduction to Data Mining
ISBN: 978-0321321367
1st edition
Authors: Pang Ning Tan, Michael Steinbach, Vipin Kumar
Posted Date:
Students also viewed these programming questions
-
An electronic circuit is created to light a lamp. There are two buttons (named A and B) in the circuit, each of which can create a "short-to-ground" that will result in the lamp failing to light when...
-
Consider the training examples shown in Table 4.1 for a binary classification problem. a) Compute the Gini index for the overall collection of training examples. (b) Compute the Gini index for the...
-
Consider the training examples shown in Table 4.2 for a binary classification problem. (a) What is the entropy of this collection of training examples with respect to the positive class? (b) What are...
-
The surveyor's formula (also called the Shoelace formula or Gauss's area formula) is a handy tool for computing the area of polygonal regions in the plane. For a triangle, it says the following:...
-
Knowing that p is proportional to L, rescale the data of Example 5.7 to plot dimensionless p versus dimensionless viscosity. Use this plot to find the viscosity required in the first row of data in...
-
What would be the marginal and average tax rates for a corporation with an income level of $ 100,000?
-
Write a short note on : Types of belt drive.11
-
Part of your company's accounting database was destroyed when Godzilla attacked the city. You been able to gather the following data from your files. Reconstruct the remaining information. Using'"...
-
A 5.0 g bullet moving 325 m/s is shot into a 1.25 kg block, which slides for 1.35 seconds across the surface it is on before coming to rest. What is the average force of kinetic friction between the...
-
A US motivational speaker was brought in by a South African private company as part of its annual general meeting to speak to the sales and marketing staff. Part of the presentation included...
-
In 2012, Nancy invested $25,000 into Relevance, a search- based marketing firm as part of a $300,000 family and friends round. She received a convertible note with a $2,000,000 cap and a 20%...
-
Dr. & Mrs. Vanderlay's Retirement Investment Objectives Provide $50,000 of withdrawals from the investment account each year. This amount will be in addition to the university pension and Social...
-
First , explain what Social Exchange Theory is and how it functions in Interpersonal Relationships. Second , describe a situation (real or hypothetical) where Social Exchange Theory is a factor in a...
-
A block sits on a slope that has an angle of 10. to the horizontal. There is a coefficient of static friction us = 0.3 between the block and the slope. You apply a force to the block in an attempt to...
-
Complete the following table. Functional Group Name Sample Functional Group R-OH R R' R Naming Rule) name the chain that came from the alcohol first name the carboxylic acid second and end with...
-
I am running for Vice President of Legislative Affairs at my University as part of the Student Government Association. I am need of ideas!!! Keep this in mind. My platform is centered around pushing...
-
On 2017-02-01, P purchased 27% of the outstanding shares of TORT common stock for $65,195 . P accounted for the investment using the equity method . On 2017-07-15, P received a cash dividend of...
-
Select the correct answer for each of the following questions. 1. On December 31, 20X3, Saxe Corporation was merged into Poe Corporation. In the business combination, Poe issued 200,000 shares of its...
-
Distinguish between noise and outliers. Be sure to consider the following questions. (a) Is noise ever interesting or desirable? Outliers? (b) Can noise objects be outliers? (c) Are noise objects...
-
Hierarchical clustering algorithms require O(m2 log(m)) time, and consequently, are impractical to use directly on larger data sets. One possible technique for reducing the time required is to sample...
-
Draw all the candidate subgraphs obtained by joining the pair of graphs shown in Figure 7.4. Assume the edge-growing method is used to expand the subgraphs.
-
Assume the same facts as in Problem 42. Further assume that next year, Gary sells the SUV for $20,000. a. How much depreciation expense can Gary deduct as a business deduction with respect to the SUV...
-
Describe how liabilities are reported and analyzed.
-
Assume the same facts as in Problem 29, except that Sally has AGI of \($75,000.\) What is her qualified student loan interest deduction in 2019? Problem 29, In 2019, Sally Morris, a single taxpayer,...
Study smarter with the SolutionInn App