Question: PROGRAM IN R CONSOLE please Download file here: https://files.fm/u/96w8jg8j # Question 1: # Read in the SalesData2017.csv data into a dataframe using the read.csv() function
PROGRAM IN R CONSOLE please
Download file here: https://files.fm/u/96w8jg8j
# Question 1:
# Read in the SalesData2017.csv data into a dataframe using the read.csv() function
# Question 2:
# Convert the SalesDate column to a date column using the as.Date() function
# Question 3:
# Explore the data and find different summary statistics such as:
# a.) How many rows of data are there?
# b.) What is the average SalesDollars?
# c.) What is the earliest and latest date?
# d.) What is the median SalesQuantity?
# e.) What is the most popular size to buy? How many of that size have been sold?
# Question 4:
# Assign dataframe called 'blacklabel' to just have the description 'Johnnie Walker Black Label'.
# Check the environment to make sure you filtered properly.
# Question 5:
# Generate a scatter plot of the newly created dataframe with SalesDate in the x axis and SalesDollars in the y axis.
# What patterns do you notice?
# What do you think are the reasons for this pattern? Explore the data more to find reasons for these trends.
# Question 6:
# Put a new column in the blacklabel dataframe extracting the months from the SalesDate column using the months function
# Question 7:
# Find the sum of the salesdollars each month and store it in a new dataframe called blacklabel_monthsum
# Question 8:
# Generate a bar chart of the total sales dollars and month using the dataframe you just created. What are the outlier months?
# Question 9:
# Filter the black label data to just March
# Question 10:
# Examine the dataset on your own. Write down any observations you may have about what may contribute to the outlier sales.
Step by Step Solution
There are 3 Steps involved in it
Get step-by-step solutions from verified subject matter experts
