The Advertising data set is provided to you. Here is a scenario. Suppose that you are...
Fantastic news! We've Found the answer you've been seeking!
Question:
Transcribed Image Text:
The Advertising data set is provided to you. Here is a scenario. Suppose that you are a statistical consultant hired by a client to investigate the association between advertising and sales of a particular product. The Advertising data set consists of the sales of that product in 200 different markets, along with advertising budgets for the product in each of those markets for three different media: TV, radio, and newspaper. It is not possible for our client to directly increase sales of the product. On the other hand, they can control the advertising expenditure in each of the three media. Therefore, if we determine that there is an association between advertising and sales, then we can instruct our client to adjust advertising budgets, thereby indirectly increasing sales. In other words, our goal is to develop an accurate model that can be used to predict sales on the basis of the three media budgets. In this setting, the advertising budgets are input variables or predictors while sales is an output variable or response. We will now try to predict sales using the other variables in this data set. In the Advertising data set, sales is measured in thousands of units, and TV, radio, and newspaper budgets, are measured in thousands of dollars. (i) Split the data with 80 percent of data into training and 20 percent to the test set. Then scale the X train and X test data using Standard Scalar. (ii) Fit a LASSO regression on the training data using sales as the target variable and all other variables as the predictors using alpha = 2. Are all the predictors statistically significant? Predict on the test dataset and report the RMSE and MAPE. (2 points) (iii) Write the LASSO regression equation. Predict the sales for the new data point in which TV = 137, radio = 10, newspaper = 14. (2 points) (iv) Find the optimal value of alpha for LASSO regression using grid search cross-validation using all training datasets. What is the best alpha value? (2 points) (v) Fit a LASSO regression on the training data using the best alpha chosen, with sales as the target variable and all other variables as the predictors. Predict on the test dataset and report the RMSE and MAPE. Did you see the improvement in the RMSE and MAPE scores? (2 points) (vi) Write the LASSO regression equation. Predict the sales for the new data point in which TV = 137, radio = 10, newspaper = 14. (2 points) (vii) (Bonus) Create appropriate interactive visualization for the lasso regression residuals that provides new insights to your results. (2 points) The Advertising data set is provided to you. Here is a scenario. Suppose that you are a statistical consultant hired by a client to investigate the association between advertising and sales of a particular product. The Advertising data set consists of the sales of that product in 200 different markets, along with advertising budgets for the product in each of those markets for three different media: TV, radio, and newspaper. It is not possible for our client to directly increase sales of the product. On the other hand, they can control the advertising expenditure in each of the three media. Therefore, if we determine that there is an association between advertising and sales, then we can instruct our client to adjust advertising budgets, thereby indirectly increasing sales. In other words, our goal is to develop an accurate model that can be used to predict sales on the basis of the three media budgets. In this setting, the advertising budgets are input variables or predictors while sales is an output variable or response. We will now try to predict sales using the other variables in this data set. In the Advertising data set, sales is measured in thousands of units, and TV, radio, and newspaper budgets, are measured in thousands of dollars. (i) Split the data with 80 percent of data into training and 20 percent to the test set. Then scale the X train and X test data using Standard Scalar. (ii) Fit a LASSO regression on the training data using sales as the target variable and all other variables as the predictors using alpha = 2. Are all the predictors statistically significant? Predict on the test dataset and report the RMSE and MAPE. (2 points) (iii) Write the LASSO regression equation. Predict the sales for the new data point in which TV = 137, radio = 10, newspaper = 14. (2 points) (iv) Find the optimal value of alpha for LASSO regression using grid search cross-validation using all training datasets. What is the best alpha value? (2 points) (v) Fit a LASSO regression on the training data using the best alpha chosen, with sales as the target variable and all other variables as the predictors. Predict on the test dataset and report the RMSE and MAPE. Did you see the improvement in the RMSE and MAPE scores? (2 points) (vi) Write the LASSO regression equation. Predict the sales for the new data point in which TV = 137, radio = 10, newspaper = 14. (2 points) (vii) (Bonus) Create appropriate interactive visualization for the lasso regression residuals that provides new insights to your results. (2 points)
Expert Answer:
Related Book For
Introduction To Probability And Statistics
ISBN: 9781133103752
14th Edition
Authors: William Mendenhall, Robert Beaver, Barbara Beaver
Posted Date:
Students also viewed these databases questions
-
Planning is one of the most important management functions in any business. A front office managers first step in planning should involve determine the departments goals. Planning also includes...
-
can someone solve this Modern workstations typically have memory systems that incorporate two or three levels of caching. Explain why they are designed like this. [4 marks] In order to investigate...
-
Suppose that the government of Malud increases both its own spending and autonomous taxes by $300 and the economys multiplier equals 2.5. If consumers spend 95% of their disposable (after-tax)...
-
Repeat Exercise 2 using the Midpoint rule. In Exercise 2 0,25 (cos x)2 dx -0.25 .0 x In(x 1) dx -0.5 ((sin x) 2xsn) dx 0.7S dx xInx
-
Mr A has been working for several years as a senior manager in France for the company AVIRON. A year ago, his employer offered him to take over the technical management of its subsidiary AVIRON ON...
-
The English scores for a sample of 14 students Find the mean, the median, and the mode of the data, if possible. If any measure cannot be found or does not represent the center of the data, explain...
-
A gas turbine power plant receives a shipment of hydrocarbon fuel whose composition is uncertain but may be represented by the expression CxHy. The fuel is burned with excess air. An analysis of the...
-
Discuss three changes written into the Constitution in 1789 that changed the government from what it was under the ArticIes of Confederation?
-
Problem H03-03 [Points: 1/5]. From the lessons learned inProblem H03-02, create a Python snippet, h0303.py, that performsthe following: (a) [Points 0.6/5] Compute the nth Fibonacci numberfor n>3...
-
Auditors do not test every document or transaction. How do they gain reasonable assurance through sampling and testing?
-
Consider economies in which individuals differ both in their wage and their preferences. Assume we want to compensate them for their wage, while staying neutral with respect to their preferences. In...
-
Describe any opportunities you have had to conduct internal or external networking. What have you learned from your networking experiences that would benefit your colleagues, as well as you in future...
-
University researchers create a positive externality because what they discover in their research laboratories can be easily learned by others who have not contributed to the researchers' costs. What...
-
Whats the most popular product type (hierarchy 1) sold in all stores over a week? How much revenue did the stores receive for that product during the week? How does that compare with the second most...
-
You need to do ratio analysis of Starbucks after reviewing its financial statement 2017: Operating Expenses" Net earnings Adjustment to concile net income to net cash: Depreciation and amortanization...
-
The Pletcher Transportation Company uses a responsibility reporting system to measure the performance of its three investment centers: Planes, Taxis, and Limos. Segment performance is measured using...
-
Consider a process with the following streams: When \(\Delta T_{\text {min }}=10^{\circ} \mathrm{C}\), the minimum utilities for heating and cooling are \(237 \mathrm{~kW}\) and \(145 \mathrm{~kW}\),...
-
Figure 11.64 presents the PFD of a process for the production of organic fibers (Smith, 2005). In the process, the organic solvent is removed from the fibers in dryer D-100 using circulated warmed...
-
Consider a process with the following streams: (a) Determine MER targets for \(\Delta T_{\text {min }}=10^{\circ} \mathrm{C}\). (b) Design a HEN for MER using no more than 10 heat exchangers...
Study smarter with the SolutionInn App