Question: Exercise 2 : Before diving deeper into the data, we should stop and create a training and a test set. Since we are trying to
Exercise : Before diving deeper into the data, we should stop and create a training and a test set.
Since we are trying to predict whether an individual earns over $K save the income column as a Series named incomelabel.
Drop the income column from the censusincome DataFrame and save the remaining columns as a DataFrame named incomefeatures.
Utilize Scikitlearn's traintestsplit function, employing the incomefeatures and incomelabel variables, to partition the data into a training set and a test set. Allocate of the instances for training and for testing. Set the randomstate to to ensure reproducibility of our results. Assign the DataFrames the following names: Xtrain, Xtest, ytrain, and ytest.
Step by Step Solution
There are 3 Steps involved in it
1 Expert Approved Answer
Step: 1 Unlock
Question Has Been Solved by an Expert!
Get step-by-step solutions from verified subject matter experts
Step: 2 Unlock
Step: 3 Unlock
