Question: In this assignment you will continue to process the dataset ( diabetes . csv ) using linear regression models and scikit - learn libraries. Write
In this assignment you will continue to process the dataset diabetescsv using linear regression models and scikitlearn libraries. Write a report on your observations.
Download disabetesdfcsv you created during Assignment
Create a Pandas dataframe from diabetesdfcsv and call it assignmentdf
Setup the Machine Learning Model:
Divide the data into features X array and target y array.
Split the dataset into and ratios. Example: means, training data, testing data, and so on
For each data split, apply logistic regression machine learning model to build confusion matrix and accuracy estimates.
Which data split is providing you the best accuracy?
For the selected data split in step run the bootstrap analysis and calculate pvalue, and confidence intervals.
Write a short report documenting your observations such as accuracy, threshold for receiver operator curve, pvalue for model acceptance, and a histogram showing the confidence intervals.
Step by Step Solution
There are 3 Steps involved in it
1 Expert Approved Answer
Step: 1 Unlock
Question Has Been Solved by an Expert!
Get step-by-step solutions from verified subject matter experts
Step: 2 Unlock
Step: 3 Unlock
