Question: Find Unique Elements in a RDD Spark. Hello, I have a Data set: Find unique elements in the Column'Animal' Animal Count Dog . 1 Cat
Find Unique Elements in a RDD Spark.
Hello,
I have a Data set:
Find unique elements in the Column'Animal'
Animal Count
Dog . 1
Cat . 3
Dog . 4
Cow . 5
I want the result to be a new RDD
Animal
Dog
Cat
Cow
Please implement this in Pyspark (Apache Spark using RDD and not DataFrame)
Also please let me know how to select a particular column from an RDD.
Step by Step Solution
There are 3 Steps involved in it
Get step-by-step solutions from verified subject matter experts
