Consider a dataset with three columns of binary attributes A, A and a binary label attribute...
Fantastic news! We've Found the answer you've been seeking!
Question:
Transcribed Image Text:
Consider a dataset with three columns of binary attributes A₁, A₂ and a binary label attribute Y. There are eight types of data point in total, and their corresponding proportions in the dataset are captured in the column P. type 1 2 3 4 5 6 7 8 A₁ A2 Y P 0 0 0 8% 0 1 0 0 1 1 1 1 0 1 OTT 1 0 0 1 1 0|1|0| 0 1 Hol 0 1 29% 2% 18% 16% 2% 1% 24% (a) What is the GINI index of the dataset? (b) What is the GINI index of the split on A₁? (c) What is the GINI index of the split on A₂? (d) Construct a decision tree of exactly five nodes for the dataset using the Hunt's algorithm. (e) What is the accuracy of the decision tree built in (d)? Consider a dataset with three columns of binary attributes A₁, A₂ and a binary label attribute Y. There are eight types of data point in total, and their corresponding proportions in the dataset are captured in the column P. type 1 2 3 4 5 6 7 8 A₁ A2 Y P 0 0 0 8% 0 1 0 0 1 1 1 1 0 1 OTT 1 0 0 1 1 0|1|0| 0 1 Hol 0 1 29% 2% 18% 16% 2% 1% 24% (a) What is the GINI index of the dataset? (b) What is the GINI index of the split on A₁? (c) What is the GINI index of the split on A₂? (d) Construct a decision tree of exactly five nodes for the dataset using the Hunt's algorithm. (e) What is the accuracy of the decision tree built in (d)?
Expert Answer:
Answer rating: 100% (QA)
a The GINI index of the dataset can be calculated as follows GINI 1 PY02 PY12 1 05372 04632 0499 b T... View the full answer
Related Book For
Introduction to Data Mining
ISBN: 978-0321321367
1st edition
Authors: Pang Ning Tan, Michael Steinbach, Vipin Kumar
Posted Date:
Students also viewed these databases questions
-
A 1 defects size B D E 88888888888888888888888888 2 21 3 24 4 16 5 12 6 15 7 5 8 28 9 20 10 31 11 25 12 20 13 24 14 16 15 19 16 10 17 17 18 13 19 22 20 18 21 39 22 30 23 24 24 16 25 19 26 17 27 15...
-
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 A B Total assets Sales Debt-equity ratio Return on equity D $ $ Net income 2,604 5,783 0.75 E 11% F Y3K, Inc., has sales of $5,783, total assets of...
-
2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 ARASIN22227 17 18 19 20 21 23 24 25 26 1) (2) Sales Price per unit Variable Cost per unit Contribution per unit Contribution Margin Ratio Break Even Point Sales...
-
There is a crop that the value p next month is random. The value can be either small p=1 or large P=3 depending on the weather. Half of people are optimists believing the value of the crop will be...
-
George Parker was paid a salary of $74,700 during 2013 by Umberger Company. In addition, during the year, Parker started his own business as a public accountant and reported a net business income of...
-
When preparing departmental contribution statements of financial performance, all direct expenses are controllable by department managers. Discuss, giving examples.
-
For the loads listed: c. What size inverter (peak watts) should she purchase? d. If the inverter is \(88 \%\) efficient, how much more daily energy is required from the \(\mathrm{PV}\) array as...
-
Franklin Paper Company manufactures newsprint. The product is manufactured in two departments, Papermaking and Converting. Pulp is first placed into a vessel at the beginning of papermaking...
-
ToyJoy! estimates that customers will be granted 2,600 in refunds of this year's sales next year and the merchandise expected to be returned will have a cost $2,000. How would I journalize the...
-
Costello Company has several divisions. The controller, Sarah James, prepares monthly segment reports for each division. Each division manager is evaluated annually, based largely on the segment...
-
the following questions with at least 200 words , and a recommended length of 250 words . This is roughly equivalent to 3 - 4 paragraphs or 1/2 page with standard font and line spacing in Microsoft...
-
Bentall Ink is a chain of tattoo parlors that follows IFRS. The following data is for Year 8: Golf club dues were $20,000. Fines for operating without the proper city zoning were $10,000. Automated...
-
Primary tax authority comes from statutory, administrative, and judicial sources. This includes, but is not limited to: Internal Revenue Code Revenue Rulings Financial Accounting Standards Board...
-
Suppose Fry's Electronics, Inc. provides $10,500 of computer support at the Dallas-Fort Worth store during the month of November. How would Fry's Electronics record this transaction? Assume all...
-
Ware Co. produces and sells motorcycle parts. On the first day of its fiscal year, Ware Co. issued $90,000,000 of five-year, 14% bonds at a market (effective) interest rate of 12%, with interest...
-
There are basically two methods of recognizing bad debt expense: (1) the direct write-off method and (2) the allowance method. (1) the direct write-off method and (2) the allowance method of...
-
Write an analogy representing the Speech Communication Process, highlighting each of the seven elements. One example is a volleyball game Brief example: "The speech communication model is like a...
-
Vince, Inc. has developed and patented a new laser disc reading device that will be marketed internationally. Which of the following factors should Vince consider in pricing the device? I. Quality of...
-
Describe the potential time complexity of anomaly detection approaches based on the following approaches: model-based using clustering, proximity-based, and density. No knowledge of specific...
-
A few months later, you are again approached by the same marketing director as in Exercise 3. This time, he has devised a better approach to measure the extent to which a customer prefers one product...
-
Show that 1 minus the Jaccard similarity is a distance measure between two data objects, x and y, that satisfies the metric axioms given on page 70. Specifically, d(x, y) = 1 J(x, y).
-
Use the Chart screen (Chart ) to generate historical prices of a selected stock and its call and put options with different expirations and expiration. Select a period in which the options were...
-
Use the Chart screen (Chart ) to generate historical prices for the S\&P 500 spot, and call and put options on the index with different expirations and expiration. Select a period in which the...
-
Suppose just prior to going ex-dividend, XYZ stock is trading at \(\$ 65\) and is expected to go ex-dividend with a dividend expected to be worth \(\$2.50\) on the ex-dividend date. What advice would...
Study smarter with the SolutionInn App