Question: 1. Read the data description available on Kaggle. https://www.kaggle.com/datasets/fedesoriano/heart-failure-prediction 2. Add the following code to load the data. import numpy as np import pandas as

1. Read the data description available on Kaggle.

https://www.kaggle.com/datasets/fedesoriano/heart-failure-prediction

1. Read the data description available on Kaggle. https://www.kaggle.com/datasets/fedesoriano/heart-failure-prediction 2. Add the

2. Add the following code to load the data.

import numpy as np import pandas as pd from sklearn.model_selection import train_test_split target = ["normal (negative)", "heart disease (positive)"] X = pd.read_csv('heart.csv') X_train, X_test = train_test_split(X, random_state=12, train_size=800, shuffle=True)

Use the training and test sets obtained from this step. Note that X_train and X_test are pandas dataframes.

3. Preprocess the data using the techniques learned in this course. Follow these rules when preprocessing the data.

Any preprocessing made on the training set must be also applied on the test set.

The test set values should not be read. For instance, suppose you want to replace a categorical value with its frequency. You should only use the training set to calculate the frequency of that value. Then make the replacement in the training and test sets.

After all preprocessing is complete, the target (i.e. 'HeartDisease') must be the last column.

4. Once all training and test set values are numerical or boolean, use the following code to convert the pandas dataframes to numpy arrays, and to store the features and targets in separate variables.

X_train = X_train.to_numpy().astype('float') X_test = X_test.to_numpy().astype('float') X_train, y_train = X_train[:, :-1], X_train[:, -1] X_test, y_test = X_test[:, :-1], X_test[:, -1]

5. Train a model using a scikit-learn classifier, then make predictions on the test set.

6. Show the model's performance by displaying a confusion matrix. And calculate the accuracy, precision, and recall of the model.

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!

Complete with Python programming in areas where ### ENTER CODE HERE ### []: \# common imports import numpy as np import pandas as pd \# Do not change these options; This allows the CodeGrade auto...

Data Science, Python, Jupyter Notebook I have a term project for my Capstone class in Data Science. Below is the syllabus, dataset, and the Jupiter Notebook. I am creating a Classification model to...

dont worry about extra credit. Some code is provided so fill in what is needed Class Output The input data x1, x2, y can be loaded from file: "x1x2y_circle2.csv" Work of this file to implement the...

I'm investigating the relationship between Median value of owner-occupied homes and various features available to us in the Boston housing dataset import numpy as np import pandas as pd from sklearn...

Make use of the scikit-learn (sklearn) python package in your function implementations Complete train_test_split function Using the train_test_split function from sklearn implement a function that...

Use numpy,pandas,seaborn, matplotlib. Pls pay attentionto the asked question. import sys , import numpy as np , import pandas as pd , import matplotlib.pyplot as plt , from matplotlib import rcParams...

\ geoquad Understand the data set and program a decision tree on it ( any programming language is OK ) \ geoquad Randomly select 2 0 % , 4 0 % , 6 0 % , and 8 0 % as your training set, and the rest...

# numpy and pandas import numpy as np import pandas as pd import math #graphics with matplotlib import matplotlib.pyplot as plt plt.style.use('seaborn') %matplotlib inline # model, train/test split,...

Here is a simple definition of data science: Data science combines multiple fields including statistics, scientific methods, and data analysis to extract value from data. Those who practice data...

Need a Python code to able fill out the table below: For this assignment, you are to determine which model is best for prediction, report the right hyperparameters, and the resulting accuracy for the...

The Cypress Tool & Die Companys fiscal year ends on December 31. The company had the following items on its 20X1 income statement and balance sheet (in millions): Net sales and other operating...

Casey Morgan is a single taxpayer, social security number 412-34-5670, who lives at 582 Brockton Lane, Columbus, OH 43081. Casey has income from a job as a manager, interest and dividend income, and...

A massage therapist charges $ 1 0 0 for one hour session, but as a reward to his customers, every third and fourth session is reduced to a charge of $ 7 0 and every fifth session is free. If every...

Compared with half a century ago, adoption has become _ _ _ _ _ _ _ _ _ common, but it is more open and acceptabl e , so we probably discuss it _ _ _ _ _ _ _ . fill in the blanks more or much less or...

=+ a. How does this change affect the incentives for working?

=+ b. How might this change represent a trade-off between equality and efficiency?

=+and you did all the cleaning, would your chores take you more or less time than if you divided