Question: ` ` ` In [ ] : import pandas as pd import matplotlib.pyplot as plt data = pd . read _ csv ( .

` ` `

[]

: import pandas as pd

import matplotlib.pyplot as plt

data

=

.

read

_

csv

(" . /

alleghenyCensusTractIncome

_

processed.csv

")

data

=

data

[

data

["

Type

"] = =

"Households"

] [["

Census Tract","Mean income

(

dollars

) "]]

("

Number of Census Tracts:

%

" %

len

(

data

))

data

["

Mean income

(

dollars

) "] .

hist

()

plt

.

xlabel

("

Avg

.

Annual Household Income

(

) ",

fontsize

= 15)

plt

.

ylabel

("

Number of Census Tracts",fontsize

= 15)

data.head

()

` ` `

If I take a random sample of

50

Census Tracts, what is the probability that the sample's expected value falls between

\ (\

100, 000 \)

and

\ (\

110, 0000 \) ?

Previously, we answered this kind of question by referring to the following plot of the normal distribution. We will discuss how to calculate these probabilities for any interval on the distribution in a future module. But, for now, know that the area under the curve is representing the probability that the sample mean falls within an interval. For example, the probability that the sample mean is between the population mean and

1

standard error above the mean

(

.

.,

between

0

and

\ (1 \

sigma

\)

in the plot

)

\ (34.1 \ % \) .

For now, I want you to use the Scipy Python package to answer this question with an exact probability. Scipy is a Python package designed for scientific computing and it contains many useful functions for statistics and machine learning. The "scipy.stats.norm" is a Python class for representing the Normal Distribution given an expected value and the standard deviation.

` ` `

[]

: # Here is an example using scipy.stats.norm to calculate the probability that the sample mean is below

2

given that the populatic

import scipy

popExpectedValue

= 3,

standardError

= 1

=

scipy.stats.norm

(

loc

= 3,

scale

= 1)

=

.

cdf

(2)

("

Probability the expected value of the sample is below

2

% 0.3

" %

)

` ` `

Use the cell below to answer the question about Allegheny County Census Tracts: If I take a random sample of Census Tracts, what is the probability that the sample's expected value falls between

\ (\

100, 000 \)

and

\ (\

110, 0000 \) ?

` ` `

[]

: import scipy

def calculate

_

income

_

distribution

(

data

)

# your code here

raise NotImplementedError

return #your code here.

calculate

_

income

_

distribution

(

data

)

` ` `

` ` ` In [ ] : import pandas as pd import

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Programming Questions!

%% Python Jupitar Notebook - Please let me know if you need anything I have pasted the carsData.csv file data in bellow. Thank you%%% CarsData.csv...

import numpy as np import pandas as pd import matplotlib.pyplot as plt data = pd.read_csv("food-consumption.csv") food_items = data.columns[1:] def PCA(k, data): # Exclude non-numeric columns (e.g.,...

For this assignment you must work on a Google Colab notebook. If you haven't used it before, please sign in with your google account via this URL: Welcome To Colaboratory - Colaboratory_(google.com)...

################################################################################################ #Box plot code...

Different Approaches to Categorical Encoding There are multiple ways of handling Categorical variables. The two most widely used techniques: - Label Encoding - One-Hot Bncoding We will see here the...

Python - I am stuck/confused with the yellow highlights on this assignment. You can see my attempts below. For instance, with the header being infer, that means header = None? And how do I filter a...

Python - I am stuck with visualizing my data using matplotlib. Please see my attempt below and the error message. How do I load the file into df? and the index_col should be 0?!?! set it as null?...

***data.csv*** **lin_reg.py** import numpy as np import pandas as pd import matplotlib.pyplot as plt # function name: least_sq # inputs: file_name- name of the csv file # output: m(slope),...

1.Predict Mobile App Popularity mobile applications have truly revolutionized the way products and service are used. Businesses have started to realize the potential of having an app and they have...

UFO Python Download the ufo _ sightings.csv file that is located on this web page to the same directory where your Program 6 python file is located. Use pandas and / or matplotlib to produce two...

When screening prospective new ventures, venture capital firms must consider the nature of the proposed industry. Which of the following is not part of the screening of the proposed industry? a....

Between 1865 and 1890, other possible structures were proposed for benzene, two of which are shown here: Considering what nineteenth-century chemists knew about benzene, which is a better proposal...

5 6 % gat.ethics.ets.org Test for The Professional Educator in Georgia Question 5 of 1 2 One day during exam period, one of Ms . Saito's students conflides to her that another student is selling...

External auditors perform an important role in ensuring adherence to corporate governance principles through the protection of shareholder interests. Critically evaluate the role of the external...