Question: We are using the Iris dataset with errors for our analysis: Import the iris_errors file into a dataframe named df (sample done, below) import numpy

We are using the Iris dataset with errors for our analysis:

Import the iris_errors file into a dataframe named "df" (sample done, below)

import numpy as np

import pandas as pd # Reading the CSV file df = pd.read_csv("iris_errors2.csv") # Printing top 5 rows df.head()

For this Kindly answer the following questions in python programming language by implementing this in Jupyter Notebook. Kindly attach the output with the code:

1. Change the data type of the PetalWidthCm column to numeric.

2. How many rows and columns does our new dataframe have?

3. How balanced is our data now?

4. Import the Seaborn and matplotib libraries and draw pairplots for all the data columns (colored by species)

5. Do there appear to be any outliers in the data?

6. Draw stripplots for each species for sepal length and sepal width

***Draw violinplots for each species for petal length and petal width.

7. Eliminate the obvious outlier by replacing it with the average for that species in that column.

8. Draw pairplots for all data columns again (still colored by species).

9. Draw boxplots for all data columns for each species.

10. Display Pearson Correlation and heatmap for all data columns.

11. Compare the data in this homework with the data in the original, 'clean' iris.csv dataset. Do you see any significant differences?

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!

Python Script : To complete the tasks listed below, open the Project Three Jupyter Notebook link in the Assignment Information module.This notebook contains your data set and the Python scripts for...

**** I COMPLETED QUESTIONS 1-14, PLEASE PROVIDE THE SOLUTIONS FOR QUESTIONS 15 -22 Data Analysis - Celebrity Deaths in 2016 Source: Wikipedia - Deaths in 2016 Structure of dataset: File:...

Can you also explain how to call P1 from P2 and use the functions created in P1 in P2. P1 Make use of the scikit-learn (sklearn) python package in your function implementations Complete the Following...

Step 4: Hypothesis Test for the Population Mean (II)A team averaging 110 points is likely to do very well during the regular season. The coach of your team has hypothesized that your team scored at...

Activate Now Python question I do not have access to the data set, it is built in to the zybooks website Write a program that will do the following tasks: Load the file internetusage.csv into a data...

Total Number of Wins by Average Points Scored 70 60 50 Total Number of Wins 40 30 20 10 85 90 95 100 105 110 Average Points Scored Correlation between Average Points Scored and the Total Number of...

{ "nbformat": 4, "nbformat_minor": 0, "metadata": { "colab": { "name": "ICE5_NLP", "provenance": [], "collapsed_sections": [] }, "kernelspec": { "name": "python3", "display_name": "Python 3" } },...

Overview and Requirements For this programming assignment, we are going to investigate how much "work" different sorting routines do, based on the input size and order of the data. We will record the...

Evaluate the success of the tactics in cancer awareness campaign given below. Discuss what might have been done to make the tactic even more successful 1. A wide system of support 2. Several types of...

Suppose the accompanying graph shows the market for lattes at the local caf in your hometown. a. You notice that the local caf charges $4 for a latte. Move the points on the graph to label the profit...

Actual return of a risk free asset should be free from which risk ( s ) ? a . Default risk b . Reinvestment rate risk c . Inflation risk d . default and reinvestment rate risk

Sarbanes-Oxley Act of 2002 to address the accounting scandals in the late 1990s and early 2000s (Enron, WorldCom, etc.). Conversely, what existing provisions in the Act do you believe (if any) are...

2. A clear specification of who is accountable for conducting OJT. If managers conduct OJT, this is mentioned in their job descriptions and is part of their performance evaluations.

3. A thorough review of OJT practices (program content, types of jobs, length of program, cost savings) at other companies in similar industries.

7. Discuss what team training should focus on to improve team performance.