Subset the data set based on the location, day of the week, type of collision, and lighting

Question:

Subset the data set based on the location, day of the week, type of collision, and lighting condition. Compare these subsets of data to find interesting patterns. Can you identify any links between crash fatality and the aforementioned variables? Are there any missing values? Which strategy should you use to handle the missing values? Because many of the variables are categorical, you should consider transforming them into dummy variables prior to the analysis.

IDCountyCityWeekdaySeverityViolCatClearWeatherMonthCrashTypeHighway
1SAN DIEGOSAN DIEGO71801A0
2HUMBOLDTUNINCORPORATED41811A1
3VENTURAOXNARD211212A0
4STANISLAUSUNINCORPORATED41111A0
5MENDOCINOUNINCORPORATED51111A1
6LOS ANGELESLONG BEACH71313A0
7LOS ANGELESLOS ANGELES41303A0
8CALAVERASUNINCORPORATED11112A1
9SAN BERNARDINOHESPERIA21111A0
10VENTURAOXNARD50811A0
11VENTURAOXNARD60801A0
12ORANGEFULLERTON40912A0
13SAN DIEGOCHULA VISTA10311A0
14ALAMEDAOAKLAND60111A0
15LOS ANGELESLOS ANGELES50913A0
16SANTA CLARAMORGAN HILL40813A0
17LOS ANGELESLOS ANGELES30913A0
18SAN JOAQUINUNINCORPORATED30813A0
19LOS ANGELESLOS ANGELES50914A0
Fantastic news! We've Found the answer you've been seeking!

Step by Step Answer:

Related Book For  answer-question

Business Analytics

ISBN: 9781265897109

2nd Edition

Authors: Sanjiv Jaggia, Alison Kelly, Kevin Lertwachara, Leida Chen

Question Posted: