Question: Question: Exercise 3. Using DataFrames, nd the full name of every country in Oceania (continent OC). Show the rst 10 country names in ascending alphabetical



Question:
Exercise 3. Using DataFrames, nd the full name of every country in Oceania (continent OC).
Show the rst 10 country names in ascending alphabetical order.
1) Which of the following set of dataframe functions best answers exercise 3 of lab 6?
a. filter, orderBy, show(10)
b. filter, select, orderBy, show(10)
c. select, orderBy, show(10)
d. filter, select, join, orderBy, show(10)
Exercise 4. Using the fridgeDF DataFrame as input, calculate the average refrigerator eciency
for each brand. Order the results in descending order of average eciency and show the rst 5 rows.
Hint: RelationalGroupedDataset has a method called avg() for calculating per-group averages. It
works similarly to count(), except you must pass avg() the names of the columns to be averaged
2) Which of the following set of dataframe functions best answers exercise 4 of lab 6?
a. groupBy, avg, orderBy, show(5)
b. avg, orderBy, show(5)
c. groupBy, orderBy, show(5)
d. filter, groupBy, orderBy, show(5)
3) Which of the following statements about parquet storage format is false?
a. Parquet storage format stores the schema with the data.
b. Given a dataframe with 100 columns. It is faster to query a single column of the dataframe if the data is stored using the CSV storage format compared to parquet storage format.
c. Given a dataframe with 100 columns. It is faster to query a single column of the dataframe if the data is stored using the parquet storage format compared to it being stored in a CSV storage format.
d. Parquet storage format stores all values of the same column together.
4) Which of the following statements is false?
a. DataSets contain schemas whereas DataFrames do not contain schemas.
b. You can add columns to a dataframe using the withColumn function.
c. After performing a self-join on a dataframe the resulting columns will contain duplicate column names.
d. Executing queries using SparkSQL Dataframes and DataSets functions are at least as fast as using their RDD counterparts, often faster.




11 - Five Number Summary and Boxplots LEARNING OBJECTIVE: Identify the quartiles in a box-and-whisker plot. .OOD A box-and-whisker plot has been constructed for the following data: 6, 8, 11, 12, 15, 18, 23, 29, 35, and is presented below. 15 23 35 10 20 30 According to this box-and-whisker plot, the third quartile (Q3) value is O .) 15 O b.) 11 O .) 35 ( d.) 23What is the five number summary of the following box and whisker plot? 50 60 70 90 100 Create a box and whisker plot with the following set of data: 3, 2, 3, 4, 6, 6, 7 Create a box and whisker plot with the following set of data: 1, 2, 5, 6, 9, 12, 7, 10 Find the mean, median, mode, and range of the following set of data: 7, 6, 2, 7, 8, 3, 12, 9, 7, 4, 6, 7, 11ent/406551/viewContent/10433157/View 2.3 Quartiles and Box Plots Construct and Understand Box-and-Whisker Plots Question Given the following box-and-whisker plot, what is the third quartile of the data?
Step by Step Solution
There are 3 Steps involved in it
Get step-by-step solutions from verified subject matter experts
