2. Use the data set cars_kaggle.csv. a. Create 4 probit models to predict if the person...
Fantastic news! We've Found the answer you've been seeking!
Question:
Transcribed Image Text:
2. Use the data set cars_kaggle.csv. a. Create 4 probit models to predict if the person did (Purchased = 1) or did not (Purchased =0) buy a car. Use only Annual Salary (x1) as the explanatory, use only Age (x2) as the explanatory, use Annual Salary and Age, use Annual Salary, age and the interaction between Age and Annual Salary. b. List out all four models in the form of probit (ft) = a + Bx1 + Bx2... c. Fill in the following table (remember residual deviance from R is deviance) Model AIC Deviance P-value P-value Comparison Difference Pair for for comparing model to saturated Model 1 Model 2 Model 3 Model 4 comparing model to Null model (1 and 3) (2 and 3) (3 and 4) in Deviance for Nested models d. Which is the best probit model? Explain. e. Repeat parts a, b and c using logit models instead of probit models. P-value comparing models f. Which is the best logit model? g. Using the best probit model and best logit model from parts d and f, create the ROC curves. h. Using the AIC and ROC curve, which model between the best logit and best probit fits the data better? The best model to predict if a person will pass a test is a logit model that uses the hours spent on homework and the type of lecture used but not including an interaction term. This model is better than the null model (p-value = 0.003) but is still missing terms when compared to the saturated model (p- value = 0.312). However, it outperformed the other models based on difference in deviances and more area under the ROC curve. The AIC was a little bit higher than the probit model with the same explanatory variables, but they were very close, and the ROC curve was quite a bit better for the logit model. If you chose Model 1: Salary 50000 75000 100000 150000 Age 25 35 45 55 i. Summarize your results in a form like the following paragraph and fill in the predicted probabilities using the chosen model. If you chose Model 2: 50000 75000 100000 150000 Predicted Probability of a purchase If you chose Model 3 or 4: Salary Age 50000 75000 100000 150000 Predicted Probability of a purchase 35 35 35 35 45 45 45 45 Predicted Probability of a purchase 2. Use the data set cars_kaggle.csv. a. Create 4 probit models to predict if the person did (Purchased = 1) or did not (Purchased =0) buy a car. Use only Annual Salary (x1) as the explanatory, use only Age (x2) as the explanatory, use Annual Salary and Age, use Annual Salary, age and the interaction between Age and Annual Salary. b. List out all four models in the form of probit (ft) = a + Bx1 + Bx2... c. Fill in the following table (remember residual deviance from R is deviance) Model AIC Deviance P-value P-value Comparison Difference Pair for for comparing model to saturated Model 1 Model 2 Model 3 Model 4 comparing model to Null model (1 and 3) (2 and 3) (3 and 4) in Deviance for Nested models d. Which is the best probit model? Explain. e. Repeat parts a, b and c using logit models instead of probit models. P-value comparing models f. Which is the best logit model? g. Using the best probit model and best logit model from parts d and f, create the ROC curves. h. Using the AIC and ROC curve, which model between the best logit and best probit fits the data better? The best model to predict if a person will pass a test is a logit model that uses the hours spent on homework and the type of lecture used but not including an interaction term. This model is better than the null model (p-value = 0.003) but is still missing terms when compared to the saturated model (p- value = 0.312). However, it outperformed the other models based on difference in deviances and more area under the ROC curve. The AIC was a little bit higher than the probit model with the same explanatory variables, but they were very close, and the ROC curve was quite a bit better for the logit model. If you chose Model 1: Salary 50000 75000 100000 150000 Age 25 35 45 55 i. Summarize your results in a form like the following paragraph and fill in the predicted probabilities using the chosen model. If you chose Model 2: 50000 75000 100000 150000 Predicted Probability of a purchase If you chose Model 3 or 4: Salary Age 50000 75000 100000 150000 Predicted Probability of a purchase 35 35 35 35 45 45 45 45 Predicted Probability of a purchase
Expert Answer:
Related Book For
Income Tax Fundamentals 2013
ISBN: 9781285586618
31st Edition
Authors: Gerald E. Whittenburg, Martha Altus Buller, Steven L Gill
Posted Date:
Students also viewed these programming questions
-
Determine the taxable income and income tax payable for the following situation. Reference the Internal Revenue Code to justify all determinations that you make. Bill sold his primary residence on...
-
The data set Cars contains the make, model, equipment, mileage, and Kelley Blue Book suggested retail price of several used 2005 General Motor cars. Kelley Blue Book (www.kbb.com) has been an...
-
The study used for exercises in Sections 8.1 and 8.2 actually included a fifth degree of baldness, extreme. Here is a table that summarizes the full data set: What stands out about the new category...
-
Each table of values gives several points that lie on a line.(a) What is the x-intercept of the line? The y-intercept?(b) Which equation in choices AD corresponds to the given table of values?(c)...
-
Draw the logic circuit corresponding to the expression ((p q) (q r), and determine the output when the input values are p = 0, q = 0, r = 1?
-
Bethesda Mining is a midsized company. Bethesda has been approached by Mid-Ohio Electric Company with a request to supply coal for its electric generators for the next four years. Bethesda Mining...
-
A system is initially charged with \(4 \mathrm{~mol}\) ethylene and \(3 \mathrm{~mol}\) oxygen, and the following reaction occurs between the species: \[ \begin{aligned} \mathrm{C}_{2}...
-
The just completed year, Hanna Company had a net income of $35,000. Balances in the companys current asset and current liability accounts at the beginning and end of the year were: The Deferred...
-
Analyze the challenges and best practices associated with kernel upgrades and patch management in production environments. How does the kernel community ensure backward compatibility and minimize...
-
Amy Lloyd is interested in leasing a new Honda and has contacted three automobile dealers for pricing information. Each dealer offered Amy a closed-end 36-month lease with no down payment due at the...
-
Consider the dataset with 2 features fland f2 and two data points x1 and x2 and corresponding labels as y1 and y2 with real values. Consider below mentioned perceptron with weights w1 and w2 and...
-
Under what circumstances can a corporation be held legally responsible for a white-collar offense committed by one of its agents or employees?
-
To what extent have the federal courts approved drug testing of students in public institutions?
-
Why do federal criminal statutes figure so prominently in the context of whitecollar offenses?
-
How does the prosecution establish that a defendant is in constructive possession of illicit drugs?
-
What is the federal constitutional authority for enacting (a) antitrust and wire fraud laws? (b) postal offenses? (c) securities laws? (d) bankruptcy laws?
-
How does the Federal Reserve's monetary policy influence the overall level of economic activity, and what tools does it have at its disposal to achieve its policy objectives?
-
The following information is for Montreal Gloves Inc. for the year 2020: Manufacturing costs Number of gloves manufactured Beginning inventory $ 3,016,700 311,000 pairs 0 pairs Sales in 2020 were...
-
Cedar Corporation has an S corporation election in effect. During the 2012 calendar tax year, the corporation had ordinary taxable income of $200,000, and on January 15, 2012, the corporation paid...
-
Kent Pham, CPA, is a 45-year-old single taxpayer living at 169 Trendie Street, La Jolla, CA 92037. His Social Security number is 865-68-9635. In 2012, Kent's W-2 as the controller of a local...
-
Tom has a successful business with $100,000 of income in 2012. He purchases one new asset in 2012, a new machine which is 7-year MACRS property and costs $25,000. If you are Tom's tax advisor, how...
-
Determine the differential equations governing the motion of the system by using the equivalent systems method. Use the generalized coordinates shown in Figures P2.52. k FIGURE P 2.52 E x 2k www
-
Determine the differential equations governing the motion of the system by using the equivalent systems method. Use the generalized coordinates shown in Figures P2.53. 2r k ww T m FIGURE P 2.53
-
Determine the differential equations governing the motion of the system by using the equivalent systems method. Use the generalized coordinates shown in Figures P2.54. 3L - C Slender bar of mass m...
Study smarter with the SolutionInn App