Question: from Chapter 2 of Spark Guide 0. Construct your own schema for the flight data and use it to read in the flight data 1.
from Chapter 2 of Spark Guide
0. Construct your own schema for the flight data and use it to read in the flight data
1. Read in the Flight data ( summary-2015.csv) in DataBricks ( or IntelliJ)
1.5 Rename the DEST_... column and the ORIG_ ... column to names you like
2. Sort the DataFrame on the count column and output the first 10 rows
3. Construct a case class and convert the DataFrame to a Dataset
4.Convert your Dataset to a SQL table and use SQL to select the columns and print out the top 10,
Step by Step Solution
There are 3 Steps involved in it
Get step-by-step solutions from verified subject matter experts
