Question: When discussing the evaluation of model performance, we introduced the method of dividing data into training vs. test set. In the illustration of the Caret
When discussing the evaluation of model performance, we introduced the method of dividing data into training vs. test set. In the illustration of the Caret package, we further demonstrate that the summary statistics of training and test are similar. The question for the discussion is, in machine learning, why should we divide the data into two sets of similar characteristics, and how do we do it?
Step by Step Solution
There are 3 Steps involved in it
1 Expert Approved Answer
Step: 1 Unlock
Question Has Been Solved by an Expert!
Get step-by-step solutions from verified subject matter experts
Step: 2 Unlock
Step: 3 Unlock
