Question: I need both a. And b. ASAP The data science team in a genetic testing company has developed a predictive model to identify Type 1
I need both a. And b. ASAP

The data science team in a genetic testing company has developed a predictive model to identify Type 1 Gaucher disease. From domain knowledge, the prevalence of Type 1 Gaucher disease in the US population is 4%. The model was built on a dataset of 4000 samples, of which 1800 samples were diagnosed as positive. The team partitioned the dataset into 70% training and 30% validation with a stratified sampling technique. The sensitivity and specificity achieved on the validation set are 70% and 90%, respectively. a. Calculate the adjusted misclassification rate, precision, and recall on the validation set. Comment on the model performance. b. Recommend another scheme to deal with the unbalanced data for this data science team
Step by Step Solution
There are 3 Steps involved in it
Get step-by-step solutions from verified subject matter experts
