Below is a dataset of the 2201 passengers and crew aboard the RMS Titanic, which disastrously...
Fantastic news! We've Found the answer you've been seeking!
Question:
Transcribed Image Text:
Below is a dataset of the 2201 passengers and crew aboard the RMS Titanic, which disastrously sunk on April 15th, 1912. For every combination of three variables (Class, Gender, Age), we have the counts of how many people survived and did not. We've also included rollups on individual variables for convenience. Class Gender Age Survived Total No Yes 1st Male Child 0 5 5 1t Male Adult 118 57 175 1* Female Child 0 1 1st Female Adult 4 140 144 Lower Male Child 35 24 59 Age Survived Total Lower Male Adult 1211 281 1492 No Yes Lower Female Child 17 27 44 Child 52 57 109 Lower Female Adult 105 176 281 Adult 1438 654 2092 Class Survived Total Gender Survived Total No Yes No Yes 122 203 325 Male 1364 367 1731 Lower 1368 508 1876 Female 126 344 470 We are interested in predicting the outcome variable Y, survival, as a function of a) the input features Class (C), Gender (G) and Age (A). Use the Gini impurity criterion to choose which of the three features C, G or A to use at the root of the decision tree. In fact, your task here is to learn a depth 1 decision tree that uses only this root feature to classify the data (decision stumps). Please show all work, including Gini impurity and overall cost function calculations for each candidate feature. b) training data? What is the accuracy rate of your decision stump (depth 1 decision tree) on the Below is a dataset of the 2201 passengers and crew aboard the RMS Titanic, which disastrously sunk on April 15th, 1912. For every combination of three variables (Class, Gender, Age), we have the counts of how many people survived and did not. We've also included rollups on individual variables for convenience. Class Gender Age Survived Total No Yes 1st Male Child 0 5 5 1t Male Adult 118 57 175 1* Female Child 0 1 1st Female Adult 4 140 144 Lower Male Child 35 24 59 Age Survived Total Lower Male Adult 1211 281 1492 No Yes Lower Female Child 17 27 44 Child 52 57 109 Lower Female Adult 105 176 281 Adult 1438 654 2092 Class Survived Total Gender Survived Total No Yes No Yes 122 203 325 Male 1364 367 1731 Lower 1368 508 1876 Female 126 344 470 We are interested in predicting the outcome variable Y, survival, as a function of a) the input features Class (C), Gender (G) and Age (A). Use the Gini impurity criterion to choose which of the three features C, G or A to use at the root of the decision tree. In fact, your task here is to learn a depth 1 decision tree that uses only this root feature to classify the data (decision stumps). Please show all work, including Gini impurity and overall cost function calculations for each candidate feature. b) training data? What is the accuracy rate of your decision stump (depth 1 decision tree) on the
Expert Answer:
Answer rating: 100% (QA)
2a For gender HY G pM ale Y es log pY esM ale pM ale No log pNoM ale pF emale Y es ... View the full answer
Related Book For
Business Statistics
ISBN: 978-0321925831
3rd edition
Authors: Norean Sharpe, Richard Veaux, Paul Velleman
Posted Date:
Students also viewed these algorithms questions
-
People in real estate are interested in predicting the price of a house by the square footage, and predictions will vary based on geographic area. We look at predicting prices (in $1000s) of houses...
-
Suppose you are interested in predicting the abortion rate in the USA based on various factors in the excel file Abortion.xls The definition of the variables of interest is presented in the table...
-
The 2223 people aboard the Titanic include 361 male survivors, 1395 males who died, 345 female survivors, and 122 females who died. Use the given categorical data to construct the relative frequency...
-
Using a resource-based view, explain why some firms improve their economic performance by adopting a CSR strategy, whereas others achieve no results or damaging results.
-
What is the difference between an entity's principal market and its most advantageous market?
-
P4-2 Summer Corp. owns all the shares of Keira Ltd. The shares were acquired on July 1, 2011, by Summer at a cost of $60,000. At acquisition date, the capital of Keira consisted of 44,000 common...
-
Consider a 3 -year \(10 \%\) coupon bond. The underlying short rate of interest follows a lattice with initial value of \(R=1.15\) and then has an factor of 1.02 , a down factor of .99 , and...
-
In Section 5.5, we showed the following two-person, zero-sum game had a mixed strategy: a. Use dominance to reduce the game to a 2 2 game. Which strategies are dominated? b. Determine the optimal...
-
Starbucks (SBUX) Income Statement for 2018 contained the following financial data: Revenue Cost of Revenue Operating Expenses Interest Expense $22,387,000 $ 9,038,000 $ 9,452,000 $ 93,000 Using the...
-
1) The ACFE Foundation works to increase the body of anti-fraud knowledge by supporting students who wish to pursue careers in fraud examination and other fraud-related fields. Explain how your...
-
You and your lifelong friend are partners together in the promotional materials business. That is, when marketing firms and their clients begin advertising or public relations campaigns, they come to...
-
Explain how to calculate a business' return on total assets and what it is used for.
-
Explain how to calculate the debt ratio and what it is used for.
-
What is the accounting equation, and how does it relate to the balance sheet of a business?
-
In each of the following situations, the total increase or decrease for one component of the accounting equation is missing. i Assets increased by \(\$ 20800\); liabilities increased by \(\$ 6400\)...
-
What is meant by the term financial flexibility, and why is it important?
-
Management is considering using activity-based costing to assign manufacturing overhead cost to products. The activity-based costing system would have the following four activity cost pools: Expected...
-
What are the key dimensions of critical thinking 2. Watch the NBC Learn video on Diet Scams. What types of claims are made in this video Are they valid Elaborate on your responses. Discuss this video...
-
In Exercise you investigated the federal rate on 3-month Treasury bills between 1950 and 1980. The scatterplot below shows that the trend changed dramatically after 1980, so weve built a new...
-
Indicate whether each statement below is true or false. If false, explain why. a) Asking viewers to call into an 800 number is a good way to produce a representative sample. b) When writing a survey,...
-
AaronsAir (Exercises) could purchase a market survey from a firm that has advised the island tourist and conference bureau. He thinks their projections would help him determine whether the...
-
If you stood atop a super-tall ladder three times as far from Earths center as at Earths surface, how would your weight compare with it present value?
-
How was Pioneer 10 able to escape the solar system with an initial speed less than escape speed?
-
With no gravity, a horizontally moving projectile follows a straight-line path. With gravity, how far below the straightline path does it fall compared with the distance of free fall?
Study smarter with the SolutionInn App