Question: Using dplyr and ggplot2 for EDA with with R Foundation for Statistical Computing For this assignment you will be using the functions from the dplyr
Using dplyr and ggplot2 for EDA with with R Foundation for Statistical Computing
For this assignment you will be using the functions from the dplyr package for data transformation and ggplot2 for data visualization.
You will be analyzing the data set flights from the nycflights13 package
1. Load the tidyverse package (this loads automatically dplyr and ggplot2)
library(tidyverse)
2. Load the nycflights13 package
library(nycflights13)
6. Make a boxplot that shows the distribution of covered distance, using the variable distance, per carrier.
What do you notice?
Find any flights by United Airlines that covered more than 3500 miles. What were the Origin and Destination of such flights?
7. Create a histogram, with the option binwidth = 10, of the variable dep_delay departure delay. Set the limits in the horizontal axis to be between 0 and 30 (use the layer coord_cartesian for this)
8. Create a bar plot to show the number of flights per carrier
Step by Step Solution
There are 3 Steps involved in it
Get step-by-step solutions from verified subject matter experts
