Below is a dataset of the 2201 passengers and crew aboard the RMS Titanic, which disastrously...
Fantastic news! We've Found the answer you've been seeking!
Question:
Transcribed Image Text:
Below is a dataset of the 2201 passengers and crew aboard the RMS Titanic, which disastrously sunk on April 15th, 1912. For every combination of three variables (Class, Gender, Age), we have the counts of how many people survived and did not. We've also included rollups on individual variables for convenience. Class Gender Age Survived Total No Yes 1st Male Child 0 5 5 1t Male Adult 118 57 175 1* Female Child 0 1 1st Female Adult 4 140 144 Lower Male Child 35 24 59 Age Survived Total Lower Male Adult 1211 281 1492 No Yes Lower Female Child 17 27 44 Child 52 57 109 Lower Female Adult 105 176 281 Adult 1438 654 2092 Class Survived Total Gender Survived Total No Yes No Yes 122 203 325 Male 1364 367 1731 Lower 1368 508 1876 Female 126 344 470 We are interested in predicting the outcome variable Y, survival, as a function of a) the input features Class (C), Gender (G) and Age (A). Use the Gini impurity criterion to choose which of the three features C, G or A to use at the root of the decision tree. In fact, your task here is to learn a depth 1 decision tree that uses only this root feature to classify the data (decision stumps). Please show all work, including Gini impurity and overall cost function calculations for each candidate feature. b) training data? What is the accuracy rate of your decision stump (depth 1 decision tree) on the Below is a dataset of the 2201 passengers and crew aboard the RMS Titanic, which disastrously sunk on April 15th, 1912. For every combination of three variables (Class, Gender, Age), we have the counts of how many people survived and did not. We've also included rollups on individual variables for convenience. Class Gender Age Survived Total No Yes 1st Male Child 0 5 5 1t Male Adult 118 57 175 1* Female Child 0 1 1st Female Adult 4 140 144 Lower Male Child 35 24 59 Age Survived Total Lower Male Adult 1211 281 1492 No Yes Lower Female Child 17 27 44 Child 52 57 109 Lower Female Adult 105 176 281 Adult 1438 654 2092 Class Survived Total Gender Survived Total No Yes No Yes 122 203 325 Male 1364 367 1731 Lower 1368 508 1876 Female 126 344 470 We are interested in predicting the outcome variable Y, survival, as a function of a) the input features Class (C), Gender (G) and Age (A). Use the Gini impurity criterion to choose which of the three features C, G or A to use at the root of the decision tree. In fact, your task here is to learn a depth 1 decision tree that uses only this root feature to classify the data (decision stumps). Please show all work, including Gini impurity and overall cost function calculations for each candidate feature. b) training data? What is the accuracy rate of your decision stump (depth 1 decision tree) on the
Expert Answer:
Answer rating: 100% (QA)
2a For gender HY G pM ale Y es log pY esM ale pM ale No log pNoM ale pF emale Y es ... View the full answer
Related Book For
Business Statistics
ISBN: 978-0321925831
3rd edition
Authors: Norean Sharpe, Richard Veaux, Paul Velleman
Posted Date:
Students also viewed these algorithms questions
-
People in real estate are interested in predicting the price of a house by the square footage, and predictions will vary based on geographic area. We look at predicting prices (in $1000s) of houses...
-
Suppose you are interested in predicting the abortion rate in the USA based on various factors in the excel file Abortion.xls The definition of the variables of interest is presented in the table...
-
The 2223 people aboard the Titanic include 361 male survivors, 1395 males who died, 345 female survivors, and 122 females who died. Use the given categorical data to construct the relative frequency...
-
Using a resource-based view, explain why some firms improve their economic performance by adopting a CSR strategy, whereas others achieve no results or damaging results.
-
Look at the illustrative new-issue prospectus in the appendix. a. Is this issue a primary offering, a secondary offering, or both? b. What are the direct costs of the issue as a percentage of the...
-
The volume of a rectangular parallelepiped is given by V= xyz. If Use this result to determine the bilateral tolerance on the volume of a rectangular parallelepiped with dimensions a = 1.500 0.002...
-
If we were to use such data and conclude that there is a correlation or association between IQ score and brain volume, does it follow that larger brains are the cause of higher IQ scores? IQ Score...
-
Missouri Fidelity Union Trust Life Insurance Company stock was trading at $2.63 per share. Eight directors sold their shares for $7.00 per share, conditioned on the resignation of eleven of the...
-
If Abigail commences the proceeding in Brampton but wishes to have it tried in Toronto, what does she do? Fiona Flapdoodle must prepare the defendant's statement of defense. She wants to know what...
-
Laurman, Inc. is considering a new project and has provided the details of the project. The Controller has asked you to compute various capital budgeting methods to help aid in the decision to pursue...
-
7.2. Consider the waveform f(t) in Figure P7.2. (a) Write a mathematical expression for f(1). (b) Find the Laplace transform for this waveform, using (7.4) and the table of integrals in Appendix A....
-
The business of B Walnut purchased some non-current assets during the year ended 30 June 2022. A photocopier (office equipment) was purchased on 31 July 2021 for $4147 ($3770 + $377 GST) and is to be...
-
H Bilberry purchased a machine on 31 July 2021, to be depreciated by the straight line method at 15% p.a., which cost $26 312 ($23 920 + $2392 GST). Installation and commissioning was completed on 31...
-
Prepare the accounts receivable and accounts payable control accounts of J Wright for November 2022. Accounts payable control balance 31 October 2022 Accounts receivable control balance 31 October...
-
A business involved with long-distance transport bought a new truck designed for haulage between all capital cities throughout the country. The motor vehicle was purchased on credit on 31 July 2021...
-
Plant and equipment was purchased on credit on 1 September 2021 for $85 415 ($77 650 + $7765 GST). Installation was finalised and commissioned on 31 October 2021 at a cost of $18 821 ($17 110 + $1711...
-
The Loan Processor sends a VOE to ABC Corporation as part of Jonas's loan process. What information will the document contain when it is returned?
-
What are the key dimensions of critical thinking 2. Watch the NBC Learn video on Diet Scams. What types of claims are made in this video Are they valid Elaborate on your responses. Discuss this video...
-
In Exercise you investigated the federal rate on 3-month Treasury bills between 1950 and 1980. The scatterplot below shows that the trend changed dramatically after 1980, so weve built a new...
-
Indicate whether each statement below is true or false. If false, explain why. a) Asking viewers to call into an 800 number is a good way to produce a representative sample. b) When writing a survey,...
-
AaronsAir (Exercises) could purchase a market survey from a firm that has advised the island tourist and conference bureau. He thinks their projections would help him determine whether the...
-
An analysis of the accounts of Beautiful Bottles Pty Ltd reveals the following manufacturing cost data for the month ended 30 June 2019. Required (a) Prepare the cost of goods manufactured schedule...
-
The following accounts and amounts (balances are normal balances) were taken from the records of Prider Manufacturers Ltd at 30 June 2019. Required (a) Prepare a cost of goods manufactured statement...
-
The following data were taken from the records of Manik Manufacturing Ltd for the year ended 30 June 2019. Required (a) Prepare the cost of goods manufactured schedule for the year ended 30 June...
Study smarter with the SolutionInn App