Develop a new custom category variable based on 'Total Crash Injuries' variable. This new custom category should
Fantastic news! We've Found the answer you've been seeking!
Question:
Develop a new custom category variable based on 'Total Crash Injuries' variable. This new custom category should contain two categories only. One category is injuries equal to zero, while the other category is for crashes with one or more injuries. Looking for screenshot of SAS Viya step for this question !!!
Transcribed Image Text:
Provide your answers to each question, including relevant figures (e.g. SAS Viya outputs) in a word/pdf document. You must answer all questions. Note the word limit of your answer script is 2500 words. Task 1: Exploratory Data Analysis (25 points) Using 'FACILITY_TOY' dataset available in SAS Viya, answer the following questions. Export/copy all charts you create into your answer script, describe the charts (e.g. what your chart is visualising in each axis) and interpret the charts, i.e. what your chart highlights. This interpretation is ideally accessible to even a non-technical reader. Q1. On a geographic map, show the countries where toy facilities are located. Size of the bubble should be the total unit capacity (4 points). Q2. What is the total unit capacity in the United States (1 point) and in Australia (1 point). Q3. Temporarily remove United States from the map you prepared in Q1. Show the updated map (without US) (3 points). Which country has the second-largest unit capacity after United States (1 point). Q4. Many countries have more than one toy facility. Further, most facilities have more than one unit manufacturing toys. As one would expect, majority of these units do not operate at full capacity. Assuming the actual usage of the units is provided by 'Unit Actual' variable and the total unit capacity is provided by 'Unit Capacity' variable, calculate the 'Capacity Utilisation Ratio' and store values in a new variable. Show how you created this calculated item by taking a screenshot of the appropriate SAS Viya window. Generate a histogram of the new variable and copy/export it into your answer script. Interpret the histogram (4 Points) Q5. Prepare a bar chart to show the average 'Capacity Utilisation Ratio' by facility for each country. Use a filter to show only Spain, Australia, and Japan in this bar chart. Copy/export the chart into your answer script. Interpret your chart (4 points) Q6. There are many factors that could explain the variation observed in the Unit Capacity Utilization Ratio. Identify two such factors and demonstrate how these two factors explain the variation in Unit Capacity Utilization Ratio with the help of two charts and associated interpretation. (3.5 points per chart) MBAS901 Final Assignment Task 2. Predictive Data Analytics (25 points) Using 'FLCRASH' data, answer the following questions. April 5, 2023 Q1. Note the variable 'Total Crash Injuries' provide several injuries associated with every accident. In SAS Viya, prepare a histogram showing the distribution of Total Crash Injuries. What can you say about the distribution of crash injuries? (2 points) Q2. Create a new custom category variable based on 'Total Crash Injuries' variable. This new custom category variable should contain two categories only. One category is injuries equal to zero, while the other category is for crashes with one or more injuries. (3 points). Visualise the frequency of the two new categories you just created on a bar chart. How many crashes report zero injuries? (3 points) Q3. In Q2, you created a new categorical variable with only two values (binary). Your task now is to develop two models that can predict the value this target variable takes, given other explanatory variables. In other words, you attempt to predict if a crash is going to result in injuries (or not) given other important variables. What are the two models (or techniques) you can use to predict this target variable? (2 points). Create one model to predict the target variable you created in Q2. Assess this model's accuracy. What are the most important variables in predicting this target variable? (6 points). Create the second model to predict the target variable. Assess this model's accuracy. What are the most important variables identified by the model to predict the target variable. (6 points). Compare the performance of the two model. Report and discuss the results of your comparison. Which model is the champion? (3 points) Provide your answers to each question, including relevant figures (e.g. SAS Viya outputs) in a word/pdf document. You must answer all questions. Note the word limit of your answer script is 2500 words. Task 1: Exploratory Data Analysis (25 points) Using 'FACILITY_TOY' dataset available in SAS Viya, answer the following questions. Export/copy all charts you create into your answer script, describe the charts (e.g. what your chart is visualising in each axis) and interpret the charts, i.e. what your chart highlights. This interpretation is ideally accessible to even a non-technical reader. Q1. On a geographic map, show the countries where toy facilities are located. Size of the bubble should be the total unit capacity (4 points). Q2. What is the total unit capacity in the United States (1 point) and in Australia (1 point). Q3. Temporarily remove United States from the map you prepared in Q1. Show the updated map (without US) (3 points). Which country has the second-largest unit capacity after United States (1 point). Q4. Many countries have more than one toy facility. Further, most facilities have more than one unit manufacturing toys. As one would expect, majority of these units do not operate at full capacity. Assuming the actual usage of the units is provided by 'Unit Actual' variable and the total unit capacity is provided by 'Unit Capacity' variable, calculate the 'Capacity Utilisation Ratio' and store values in a new variable. Show how you created this calculated item by taking a screenshot of the appropriate SAS Viya window. Generate a histogram of the new variable and copy/export it into your answer script. Interpret the histogram (4 Points) Q5. Prepare a bar chart to show the average 'Capacity Utilisation Ratio' by facility for each country. Use a filter to show only Spain, Australia, and Japan in this bar chart. Copy/export the chart into your answer script. Interpret your chart (4 points) Q6. There are many factors that could explain the variation observed in the Unit Capacity Utilization Ratio. Identify two such factors and demonstrate how these two factors explain the variation in Unit Capacity Utilization Ratio with the help of two charts and associated interpretation. (3.5 points per chart) MBAS901 Final Assignment Task 2. Predictive Data Analytics (25 points) Using 'FLCRASH' data, answer the following questions. April 5, 2023 Q1. Note the variable 'Total Crash Injuries' provide several injuries associated with every accident. In SAS Viya, prepare a histogram showing the distribution of Total Crash Injuries. What can you say about the distribution of crash injuries? (2 points) Q2. Create a new custom category variable based on 'Total Crash Injuries' variable. This new custom category variable should contain two categories only. One category is injuries equal to zero, while the other category is for crashes with one or more injuries. (3 points). Visualise the frequency of the two new categories you just created on a bar chart. How many crashes report zero injuries? (3 points) Q3. In Q2, you created a new categorical variable with only two values (binary). Your task now is to develop two models that can predict the value this target variable takes, given other explanatory variables. In other words, you attempt to predict if a crash is going to result in injuries (or not) given other important variables. What are the two models (or techniques) you can use to predict this target variable? (2 points). Create one model to predict the target variable you created in Q2. Assess this model's accuracy. What are the most important variables in predicting this target variable? (6 points). Create the second model to predict the target variable. Assess this model's accuracy. What are the most important variables identified by the model to predict the target variable. (6 points). Compare the performance of the two model. Report and discuss the results of your comparison. Which model is the champion? (3 points)
Expert Answer:
Related Book For
Applied Regression Analysis and Other Multivariable Methods
ISBN: 978-1285051086
5th edition
Authors: David G. Kleinbaum, Lawrence L. Kupper, Azhar Nizam, Eli S. Rosenberg
Posted Date:
Students also viewed these law questions
-
Build a new custom category variable based on 'Total Crash Injuries' variable. This new custom category variable should contain only two categories. One category is injuries equal to zero, while the...
-
Planning is one of the most important management functions in any business. A front office managers first step in planning should involve determine the departments goals. Planning also includes...
-
In Problems 2738, the reduced row echelon form of a system of linear equations is given. Write the system of equations corresponding to the given matrix. Use x, y; or x, y, z; or x 1 , x 2 , x 3 , x...
-
Here is a paper your friend turned in for a recent quiz in her mathematics class: If it is a four-point quiz, what is your friend's score? For each incorrect answer, provide the correct answer and...
-
Truball, Inc., which manufactures sports equipment, consists of several operating divisions. Division A has decided to go outside the company to buy materials since Division B informed it that the...
-
If the distance between the cathode and the target electrode is approximately \(1.0 \mathrm{~cm}\), what will be the maximum acceleration of the free electrons? Assume that the electric field is...
-
Multiple Choice Questions The following questions deal with audit evidence for the sales and collection cycle. Choose the best response. a. An auditor is performing substantive tests of transactions...
-
1. Find the Gross Debt Service Ratio for the following situations, and state whether these houses are affordable. a) The monthly mortgage payment is $805, monthly property taxes are $110, monthly...
-
A 4.0 g of an arbitrary radioisotope (physical half-life = 5 days) was ingested by a person. After 12 days, approximately 376 mg of the radioisotope is remained inside the body. Then, what would be...
-
Find out what database management systems are available at your university for student use. Investigate which data types these DBMSS support. Compare these DBMSS based on the data types supported and...
-
A 2011 report by the management consulting firm O'Rourke Group Partners indicated that a generic \$14 polo shirt sold in Canada and made in Bangladesh actually costs a retailer only \(\$ 5.67\)...
-
Based on the information in the Application "Botox Patent Monopoly," what would happen to the optimum price and quantity if the government had set a price ceiling of \(\$ 200\) per vial of Botox?...
-
Obtain access to a typical PC DBMS, such as Microsoft Access. What steps do you have to follow to link an Access database to a database on a server? Do any of these steps change depending on the DBMS...
-
Suppose that the job in Question 5.5 that pays \(w^{*}\) and has no restriction on hours is the higher-paying job. How do Jerome's budget constraint and behavior change? Data From Question 5.5:-...
-
Discuss the possibilities available to small business owners or potential small business owners to acquire sufficient resources necessary to continue to operate a business.
-
Consider the circuit of Fig. 7.97. Find v0 (t) if i(0) = 2 A and v(t) = 0. 1 3 ett)
-
Consider the quadratic programming problem. min x+x s.t. 1x2 = 4. x1
-
Prove that the dual function of Eq. (16.12) is concave Data From Equation (16.12) 19S See https://www.gams.com/ 20 See http://cvxr.com/
-
Consider the problem min x1+x2 s.t. h(x) = x2 x3 = 0, - - h2(x) = x2 = 0,
Study smarter with the SolutionInn App