Suppose you want to learn a decision tree from the simple dataset listed in the table...
Fantastic news! We've Found the answer you've been seeking!
Question:
Transcribed Image Text:
Suppose you want to learn a decision tree from the simple dataset listed in the table below. There are 8 samples, each with 3 binary attributes (X1, X2, X3) and a binary class label (Y). Use the data to answer the following questions. Instance 1 X, X Y 0 0 0 0 2 0 0 1 0 3 0 1 I 6 7 8 (a) Compute the entropy of the class label. (b) Calculate the information gain for each of the three attributes. (c) Which attribute should be selected for the root of the decision tree? Why? (d) After the root node is selected, the entire tree can be learned by recursively splitting the data into two subgroups, finding the next best attribute to split on, dividing the subgroup into smaller groups, and so forth. How do you know when to stop growing the tree? In other words, what are the stopping criteria? (e) By manually running the algorithm described above, draw the resulting tree. (f) Compute the training error. (g) Now suppose you are presented new instances for which the class Y is unknown. Use your decision tree to predict the label of each instance listed below. Instance X X, X Y 9 1 1 1 ? 10 1 0 0 ? 11 0 1 1 ? (h) Do you have any basis on which to evaluate if the tree is overfitting? Why or why not? How might you combat overfitting in a decision tree? Suppose you want to learn a decision tree from the simple dataset listed in the table below. There are 8 samples, each with 3 binary attributes (X1, X2, X3) and a binary class label (Y). Use the data to answer the following questions. Instance 1 X, X Y 0 0 0 0 2 0 0 1 0 3 0 1 I 6 7 8 (a) Compute the entropy of the class label. (b) Calculate the information gain for each of the three attributes. (c) Which attribute should be selected for the root of the decision tree? Why? (d) After the root node is selected, the entire tree can be learned by recursively splitting the data into two subgroups, finding the next best attribute to split on, dividing the subgroup into smaller groups, and so forth. How do you know when to stop growing the tree? In other words, what are the stopping criteria? (e) By manually running the algorithm described above, draw the resulting tree. (f) Compute the training error. (g) Now suppose you are presented new instances for which the class Y is unknown. Use your decision tree to predict the label of each instance listed below. Instance X X, X Y 9 1 1 1 ? 10 1 0 0 ? 11 0 1 1 ? (h) Do you have any basis on which to evaluate if the tree is overfitting? Why or why not? How might you combat overfitting in a decision tree?
Expert Answer:
Answer rating: 100% (QA)
To answer the questions regarding the decision tree based on the provided dataset lets go through each question step by step a To compute the entropy of the class label we need to calculate the entrop... View the full answer
Related Book For
Posted Date:
Students also viewed these mathematics questions
-
CANMNMM January of this year. (a) Each item will be held in a record. Describe all the data structures that must refer to these records to implement the required functionality. Describe all the...
-
Portray in words what transforms you would have to make to your execution to some degree (a) to accomplish this and remark on the benefits and detriments of this thought.You are approached to compose...
-
Christine has three cars that must be overhauled by her ace mechanic, Megan. Given the following data about the cars, use least slack per remaining operation to determine Megans scheduling priority...
-
Repeat problem 11-3-29 with a completely different pair of gases. Can you come up with a generalized mixing criterion that maximizes entropy generation per unit mass or mole? Problem 11-3-29...
-
In a mathematical universe and in the time t = to, a disk starts rotating with a constant angular velocity of Wdisk =+ b + ck, over which there is a randomly-located particle P with the position...
-
What is the function of an administrative agency?
-
Tell whether each of the following accounts is a current asset; an investment; property, plant, and equipment; in intangible asset; a current liability; a long-term liability; owners equity; or not...
-
To add k N-bit words, you need k-1 N-bit adder. For example, to add 0001 + 0111 + 1101 + 0010 = 10111 you need a structure similar the one shown below (a). Classical full adder sums 3 inputs to...
-
A The following information is available for Brooks Manufacturing: Finished goods inventory was 10,000 units at the beginning of the year and 8,000 units at the end of the year. The total variable...
-
This second discussion board asks you to consider the diffeences between the two traditional ways that business leaders thought about the futures of their companies: * The Industrial/Organization...
-
LF (Life is Fine) is a company that produces smartphones. They are interested in launching a promotional campaign which will cost them $100,000. Their revenue before the campaign was $400,000. After...
-
If you were in charge of securing and retaining sponsors for a company, event or venue, what key strategies would you employ to ensure your sponsors feel valued and appreciated?
-
In this project, you will demonstrate your mastery of the following competency: Describe the purpose and function of financial management in an organization Scenario You've been an entry-level...
-
Write down an equation representing the liquidity premium theory of the term structure of interest rates. Based on this theory, explain how the yields on short-term and medium-term government bonds...
-
Write a paper about the difficulties that students face in learning English because of the education system in Saudi Arabia.
-
The electric field due to a line charge is given by where l is a constant. Show that E is solenoidal. Show that it is also conservative. E =
-
Identify each of the following as a consumer product or a business product, or classify it as both: a. frozen yogurt b. iPad c. gasoline d. boat trailer e. hand sanitizer f. Post-its
-
What are the steps in developing a marketing strategy?
-
What is the difference between primary data and secondary data?
Study smarter with the SolutionInn App