Question: QUESTION 1 5 0 Marks i . Explain in details, the stages of the Machine Learning life cycle. [ 5 ] ii . Explain in
QUESTION Marks
i Explain in details, the stages of the Machine Learning life cycle.
ii Explain in detail the difference between overfitting and underfitting in Machine
Learning and ways to overcome them.
iii. What is the difference between regression and classification
iv Write a pseudo algorithm for the Kmeans clustering.
v Using examples and mathematical equations indicate the difference between Entropy
and Gini Impurity in a Decision Tree?
You are working on a binary classification problem to predict whether patients have Covid
disease Positive or not Negative using a machine learning model. After training your
model, you obtain the following confusion matrix on the test data:
Predicted Positive Predicted Negative
Actual Positive
Actual Negative
vi Calculate the following performance metrics: accuracy, precision, recall, Fscore,
and specificity and interpret these metrics in the context of the Covid disease
prediction problem.
vii. Discuss the implications of changing the decision threshold of your model. How
would increasing or decreasing the threshold affect the confusion matrix and the
derived metrics.
viii. Which type of error false positive or false negative do you think is more critical to
minimize, and why? Propose a strategy to mitigate this type of error.
ix If the prevalence of the disease is very low minority and you suspect the dataset is
imbalanced. Describe five steps you can utilize to handle this imbalance.
Step by Step Solution
There are 3 Steps involved in it
1 Expert Approved Answer
Step: 1 Unlock
Question Has Been Solved by an Expert!
Get step-by-step solutions from verified subject matter experts
Step: 2 Unlock
Step: 3 Unlock
