Question: Employee_attrition.csv is a dataset for employee attrition binary prediction problem. Split the data set employee_attrition.csv into a training set, a validation set, and a test

Employee_attrition.csv is a dataset for employee attrition binary prediction problem. Split the data set employee_attrition.csv into a training set, a validation set, and a test set with a ratio of 8:1:1. Train a decision tree model using scikit-learn, tune its parameter using the validation set, and then report its performance on the test set with the optimal parameters. This problem is from Kaggle.com competition website:

https://www.kaggle.com/competitions/playground-series-s3e3/leaderboard Currently, the best performance is 95.596% on their leaderboard. Compare yours to those and discuss the potential improvement. You can also investigate the state-of-the-art algorithm on their discussion board and report your findings. https://www.kaggle.com/competitions/playground-series-s3e3/leaderboard

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!