Question: Questions ( 1 2 marks ) Explain the main differences between RDDs , Dataframes and Datasets ( 4 marks ) Answer the following questions: 2

Questions (12 marks)
Explain the main differences between RDDs, Dataframes and Datasets (4 marks)
Answer the following questions:
2.1 How many sensor pads are reported to be from Poland (2 marks)
2.2 How many different LCDs (distinct colors) are present in the dataset (2 marks)
2.3 Find 5 countries that have the largest number of MAC devices used (2 marks)
2.4 Propose and try an interesting statistical test or machine learning model you could use to gain insight from this dataset. Note, you don't have to use Machine Learning for this question. You can apply any analysis to the data even using SparkSQL, Python visualization libraries to analyze the data. Another example cloud be to apply correlation functions or other Spark functions to analyze the data. (2 marks)

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Programming Questions!