Question: Employee Turnover Analytics Project Statement: Portobello Tech is an app innovator who has devised an intelligent way of predicting employee turnover within the company. It
Employee Turnover Analytics
Project Statement:
Portobello Tech is an app innovator who has devised an intelligent way of
predicting employee turnover within the company. It periodically evaluates
employees' work details, including the number of projects they worked on
average monthly working hours, time spent in the company, promotions in the
last five years, and salary level.
Data from prior evaluations shows the employees satisfaction in the workplace.
The data could be used to identify patterns in work style and their interest in
continuing to work for the company.
The HR Department owns the data and uses it to predict employee turnover.
Employee turnover refers to the total number of workers who leave a company
over time.
As the ML Developer assigned to the HR Department, you have been asked to
create ML programs to:
Perform data quality checks by checking for missing values, if any.
Understand what factors contributed most to employee turnover at EDA
Perform clustering of employees who left based on their satisfaction and
evaluation.
Handle the left Class Imbalance using the SMOTE technique.
Perform kfold crossvalidation model training and evaluate performance.
Identify the best model and justify the evaluation metrics used.
Suggest various retention strategies for targeted employees.
Perform the following steps:
Perform data quality checks by checking for missing values, if any.
Understand what factors contributed most to employee turnover at EDA.
Draw a heatmap of the correlation matrix between all numerical
features or columns in the data.
Draw the distribution plot of:
Employee Satisfaction use column satisfactionlevel
Employee Evaluation use column lastevaluation
Employee Average Monthly Hours use column
averagemontlyhours
Draw the bar plot of the employee project count of both employees
who left and stayed in the organization use column number project
and hue column left and give your inferences from the plot.
Perform clustering of employees who left based on their satisfaction and
evaluation.
Choose columns satisfactionlevel, lastevaluation, and left.
Do Kmeans clustering of employees who left the company into
clusters?
Based on the satisfaction and evaluation factors, give your thoughts
on the employee clusters.
Handle the left Class Imbalance using the SMOTE technique.
Preprocess the data by converting categorical columns to numerical
columns by:
Separating categorical variables and numeric variables
Applying getdummies to the categorical variables
Combining categorical variables and numeric variables
Do the stratified split of the dataset to train and test in the ratio :
with randomstate
Upsample the train dataset using the SMOTE technique from the
imblearn module.
Perform fold crossvalidation model training and evaluate performance.
Train a logistic regression model, apply a fold CV and plot the
classification report.
Train a Random Forest Classifier model, apply the fold CV and plot
the classification report.
Train a Gradient Boosting Classifier model, apply the fold CV and
plot the classification report.
Identify the best model and justify the evaluation metrics used.
Find the ROCAUC for each model and plot the ROC curve.
Find the confusion matrix for each of the models.
Explain which metric needs to be used from the confusion matrix:
Recall or Precision?
Suggest various retention strategies for targeted employees.
Using the best model, predict the probability of employee turnover
in the test data.
Based on the probability score range below, categorize the
employees into four zones and suggest your thoughts on the
retention strategies for each zone.
Safe Zone GreenScore
LowRisk Zone Yellow Score
MediumRisk Zone Orange Score
HighRisk Zone RedScore
Step by Step Solution
There are 3 Steps involved in it
1 Expert Approved Answer
Step: 1 Unlock
Question Has Been Solved by an Expert!
Get step-by-step solutions from verified subject matter experts
Step: 2 Unlock
Step: 3 Unlock
