Question: PLEASE ANSWER WITHIN PYSPARK I have a pyspark dataframe called Data I want to know how to replace 0 values in a column with the
PLEASE ANSWER WITHIN PYSPARK
I have a pyspark dataframe called Data
I want to know how to replace 0 values in a column with the average of said column
Say a column of numbers "Numbers" has 14 rows in the dataframe Data
| Numbers |
| 12 |
| 18 |
| 0 |
| 24 |
| 0 |
| 72 |
| 0 |
| 19 |
| 22 |
| 0 |
| 26 |
| 11 |
| 23 |
| 0 |
I want to take the average of this column and replace the '0' values with that average.
How would I do this?
Step by Step Solution
There are 3 Steps involved in it
Get step-by-step solutions from verified subject matter experts
