Question: Problem 2 . Rob has successfully installed Spark in Ubuntu. He also runs the WordCount example successfully. He now can start understanding each line in
Problem 2. Rob has successfully installed Spark in Ubuntu. He also runs the WordCount example successfully. He now can start understanding each line in the WordCount.py file. He opens the pyspark shell. Suppose that the Spark context sc is already defined when he opens the shell.
He wants to load the data /home/rob/data/peterpan.txt. Please tell him how to load the data in pyspark (please provide the command).
Answer: $
What is the type of the return variable?
Answer:
What is the command to split each line into words with the whitespace symbol?
Answer: $
What is the type of the return variable?
Answer:
What is the type of the function you called? transformation or action?
Answer:
Please write the command to emit the (key = word, value = 1) pairs.
Answer: $
Please write the command to reduce the (key = word, value = 1) pairs.
Answer: $
The command to display the contents in the return RDD into the terminal is
Answer: $
Step by Step Solution
There are 3 Steps involved in it
Get step-by-step solutions from verified subject matter experts
