Question: We have a DataFrame df0 with 10000 records (samples). If we assume that we run the following code: df0=df0.sample(0.20,seed=10000) train, test =df0. randomSplit (weights =[0.8,0.2],

 We have a DataFrame df0 with 10000 records (samples). If we

We have a DataFrame df0 with 10000 records (samples). If we assume that we run the following code: df0=df0.sample(0.20,seed=10000) train, test =df0. randomSplit (weights =[0.8,0.2], seed=2000) What will be the approximate size of the train DataFrame: 1600200010000 Can not be determined

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!