Question: The data had to be cleaned to ascertain the accuracy. The year of release was oftentimes reported as a range (e.g. 2008-2013), thus everyone extracted
The data had to be cleaned to ascertain the accuracy. The year of release was oftentimes reported as a range (e.g. 2008-2013), thus everyone extracted the first year of the range to be able to do trend analysis. The rating and vote fields had inconsistencies and these were changed to numeric values. Field pairs that were not important were dropped and visual depictions were done, such as a histogram of the ratings (Figure 1) and category versus episodes for IMDB top 250 shows (Figure 2). The column on the release year was also reformatted (e.g., 20082013 ), missing values discarded and the numeric fields were standardized. Pivot tables produced the information of content types against ratings, and bar and stacked bar charts visualized genre, audience rating, and age categories. - make sure the text is written in Australian english with out changing the content
Step by Step Solution
There are 3 Steps involved in it
Get step-by-step solutions from verified subject matter experts
