3. [Decision Trees] You own a movie theater and are trying to understand your market: what...
Fantastic news! We've Found the answer you've been seeking!
Question:
Transcribed Image Text:
3. [Decision Trees] You own a movie theater and are trying to understand your market: what types of people frequently go to the movies? You start with the following dataset with data about 6 people with different age groups, income levels, and professions, and whether or not they frequently go to movie theaters. In particular, you are going to build a decision tree to predict whether or not someone is a frequent movie-goer. No High Income? Engineer? Movie Goer? T F T T 1 2 3 4 T F T F Yes Yes Yes No Recall the following definitions of entropy and information gain, respectively, which are useful for this problem: H(Z) == P(Y = y) log P(Y = y) Y IG(Z, j,t) = H (Z) H(Z[x; = t])P(x; = t) - H(Z[x; t])P(xj t). a. (4 pts) Based on the principle of information gain, which attribute is to be used for the first split? Be sure to show your computations. You can round the entropy and information gain values to two decimal places. b. (4 pts) Draw the complete (unpruned) decision tree, showing the class predictions at the leaves. Assuming you are using LaTeX, you may (i) very neatly hand draw the tree, photograph it, and include it as a figure, (ii) draw it using a graph- ics program or PowerPoint, or (iii) express the tree in a series of if statements, preferably using LaTeX's verbatim environment. c. (2 pts) From the Decision Tree constructed in the previous question, predict whether a person who has high income but is not an engineer is a movie goer. 3. [Decision Trees] You own a movie theater and are trying to understand your market: what types of people frequently go to the movies? You start with the following dataset with data about 6 people with different age groups, income levels, and professions, and whether or not they frequently go to movie theaters. In particular, you are going to build a decision tree to predict whether or not someone is a frequent movie-goer. No High Income? Engineer? Movie Goer? T F T T 1 2 3 4 T F T F Yes Yes Yes No Recall the following definitions of entropy and information gain, respectively, which are useful for this problem: H(Z) == P(Y = y) log P(Y = y) Y IG(Z, j,t) = H (Z) H(Z[x; = t])P(x; = t) - H(Z[x; t])P(xj t). a. (4 pts) Based on the principle of information gain, which attribute is to be used for the first split? Be sure to show your computations. You can round the entropy and information gain values to two decimal places. b. (4 pts) Draw the complete (unpruned) decision tree, showing the class predictions at the leaves. Assuming you are using LaTeX, you may (i) very neatly hand draw the tree, photograph it, and include it as a figure, (ii) draw it using a graph- ics program or PowerPoint, or (iii) express the tree in a series of if statements, preferably using LaTeX's verbatim environment. c. (2 pts) From the Decision Tree constructed in the previous question, predict whether a person who has high income but is not an engineer is a movie goer.
Expert Answer:
Answer rating: 100% (QA)
a To determine the attribute to be used for the first split we need to calculate the information gai... View the full answer
Related Book For
Business Intelligence And Analytics Systems For Decision Support
ISBN: 9781292009209
10th Global Edition
Authors: Efraim Turban, Ramesh Sharda, Dursun Delen, Pearson Education Limited, Dennis G. Zill
Posted Date:
Students also viewed these programming questions
-
If a trespass to property permanently deprives the property holder of the use of their property, what tort has been committed?
-
Planning is one of the most important management functions in any business. A front office managers first step in planning should involve determine the departments goals. Planning also includes...
-
Prosci's change management methodology is developed based on research with over 3,400 participants over the last twenty years. What is unique about the methodology is that it comes from real project...
-
You are the assistant vice president in charge of production for a firm that produces computers. Your firm's production function is f(L,K) = min (L,K) Where L and K are the quantities of the two...
-
Which of the following molecules may show a pure rotational microwave absorption spectrum? (a) H20, (b) H202, (c) NH4 (d) N20?
-
For the fixed-bias configuration of Fig. 5.167: a. Determine Av.NL, Zi and Z0. b. Sketch the two-port model of Fig. 5.63 with the parameters determined in part (a) in place. c. Calculate the gain AvL...
-
A trout jumps, producing waves on the surface of a 0.8 -m-deep mountain stream. If it is observed that the waves do not travel upstream, what is the minimum velocity of the current?
-
Explain why, if two mutually exclusive projects are being compared, the short-term project might have the higher ranking under the NPV criterion if the cost of capital is high, but the long-term...
-
Roberta Bondar is travelling in her space craft around a distant planet. The planet has a radius R' and she is at an altitude of 2R' above its surface, coasting at some constant speed in a circular...
-
In Integrative Case 10.1, we projected financial statements for Walmart Stores for Years +1 through +5. The data in Chapter 12, Exhibits 12.17 through 12.19 include the actual amounts for 2015 and...
-
Event Management question Please organize a professional event in the field of business (i.e: a conference, summit, seminar, festival, concert etc) by closely following / applying the essential steps...
-
Healthcare professionals have a duty to maintain the confidentiality of patients. Describe what this jeans in Val won Words.
-
Read the following and answer the question - " To Be Or Not To Be " In "The Belly Of The Whale"; A Reading Of Joseph Campbell's "Modern Hero" Hypothesis In Hamlet On Film." After reading your chosen...
-
https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8623732/pdf/nutrients-13-03884.pdf Please read and critique this article answering the following questions in paragraph form: 1. What is the hypothesis...
-
C++ Improper Function Definitions in Header Files Explanation: Ensure that the function definitions in header files (e.g., subexpression.h, operand.h, etc.) are correctly defined. How would I...
-
Palomar Engineering (PEINC) Stock is trading at $100 per share today. Expecting the share price to drop, you sell short 3 shares of stock. If you close this trade in 2 weeks when the price/share is...
-
What role does each of the cells below play in the maintenance of the alveolus?
-
Why do markets typically lead to an efficient outcome for buyers and sellers?
-
Examine the difficulties to implement a new DSS over legacy systems.
-
List the different problem-solving search methods.
-
What is an ES?
-
For a 2000-line encoder sampled at 5 kHz, what is the speed resolution absent filtering?
-
a. What are the two advantages of resolvers over encoders? b. What are the two advantages of encoders over resolvers?
-
What servo drive configuration is the least intelligent? And the most intelligent?
Study smarter with the SolutionInn App