Question: We have a DataFrame df0 with 10000 records (samples). If we assume that we run the following code: df0=df0.sample(0.20,seed=10000) train, test =df0. randomSplit (weights =[0.8,0.2],

We have a DataFrame df0 with 10000 records (samples). If we assume that we run the following code: df0=df0.sample(0.20,seed=10000) train, test =df0. randomSplit (weights =[0.8,0.2], seed=2000) What will be the approximate size of the train DataFrame: 1600200010000 Can not be determined
Step by Step Solution
There are 3 Steps involved in it
Get step-by-step solutions from verified subject matter experts
