Question: Using dplyr and ggplot2 for EDA with with R Foundation for Statistical Computing For this assignment you will be using the functions from the dplyr

Using dplyr and ggplot2 for EDA with with R Foundation for Statistical Computing

For this assignment you will be using the functions from the dplyr package for data transformation and ggplot2 for data visualization.

You will be analyzing the data set flights from the nycflights13 package

1. Load the tidyverse package (this loads automatically dplyr and ggplot2)

library(tidyverse)

2. Load the nycflights13 package

library(nycflights13)

6. Make a boxplot that shows the distribution of covered distance, using the variable distance, per carrier.

What do you notice?

Find any flights by United Airlines that covered more than 3500 miles. What were the Origin and Destination of such flights?

7. Create a histogram, with the option binwidth = 10, of the variable dep_delay departure delay. Set the limits in the horizontal axis to be between 0 and 30 (use the layer coord_cartesian for this)

8. Create a bar plot to show the number of flights per carrier

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!