Question: Employee_attrition.csv is a dataset for employee attrition binary prediction problem. Split the data set employee_attrition.csv into a training set, a validation set, and a test
Employee_attrition.csv is a dataset for employee attrition binary prediction problem. Split the data set employee_attrition.csv into a training set, a validation set, and a test set with a ratio of 8:1:1. Train a decision tree model using scikit-learn, tune its parameter using the validation set, and then report its performance on the test set with the optimal parameters. This problem is from Kaggle.com competition website:
https://www.kaggle.com/competitions/playground-series-s3e3/leaderboard Currently, the best performance is 95.596% on their leaderboard. Compare yours to those and discuss the potential improvement. You can also investigate the state-of-the-art algorithm on their discussion board and report your findings. https://www.kaggle.com/competitions/playground-series-s3e3/leaderboard
Step by Step Solution
There are 3 Steps involved in it
Get step-by-step solutions from verified subject matter experts
