Question: Here is code so far, but not getting correct visual. Dotted red lines should not be straight. Showed image of what correct visual should look

Here is code so far, but not getting correct visual. Dotted red lines should not be straight. Showed image of what correct visual should look like.

import pandas as pd

import numpy as np

import matplotlib.pyplot as plt

import scipy.stats as stats

fig,axes

=

plt

.

subplots

(3, 2,

figsize

= (15, 10))

for i

,

course in enumerate

(

courses

)

data

=

[

course

] .

dropna

()

mean,std

=

data.mean

(),

data.std

()

=

axes

[

/ / 2,

% 2]

stats.probplot

(

data

,

dist

=

"norm", plot

=

)

.

set

_

title

(

" {

course

.

replace

('_

grade',

'')},

= {

len

(

data

)} ")

.

legend

([

" {

course

.

replace

('_

grade',

'')},

= {

len

(

data

)} "])

.

axhline

(

mean

+ 2 *

std

,

color

=

'red', linestyle

=' - -')

.

axhline

(

mean

- 2 *

std

,

color

=

'red', linestyle

=' - -')

.

set

_

xlabel

("

Theoretical Quantiles"

)

.

set

_

ylabel

("

Ordered Values"

)

plt

.

tight

_

layout

()

plt

.

show

()

Question

2

: Grade Distribution Normality Check

(35 \ %)

Seeing the student grade distributions of the

6

large residential courses, the team is tempted to draft recommendations for instructors and report to them what particular aspects could be addressed to improve students' academic learning outcome. However, before they launch statistical tests, they need to verify if the student grades

data approximately follows normal distribution, a sufficient condition rendering the design of statistical models valid for those courses. You suggest that a QQ

-

plot is a great method to determine how similar a distribution is to another. Great idea!

-

Make a

3 * 2

figure

(

again

, 6

subplots

)

so that for each course you have a QQ plot using the student grade samples versus the normal distribution with the same mean and standard deviation

-

You need to use a legend on each plot to specify the corresponding course name and number of students involved. For example, you can draw a legend and specify "STATS

250,

= 5000 "

to indicate that you are analyzing STATS

250

course with

5000

enrolled students records being used for analysis

-

For each QQ

-

plot, add

2

lines representing

+ / - 2

standard deviations outside from the QQ

-

line

(

a straight line showing the theoretical values for different quantiles under normal distribution

) .

Use an additional annotation inside each subplot to highlight the outliers that sit outside of these lines. I.e

.

data points that lie outside the

2

standard deviations on either side. Briefly describe the figure discussing the courses and whether they seem to be normally distributed.

Hint: You may find using fig

=

plt

.

figure

()

and fig.add

_

subplot

()

functions helpful to create subplots. You don't have to use these functions though.

ENGLISH

125,

= 14196

English

125

= 14196

Here is code so far, but not getting correct

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Programming Questions!

1. Read the article \"How to Display Data Badly,\" by Howard Wainer listed above. It was published in the American Statistician in 1984, volume 38, pages 137-147. It is a quick, informative, and...

FINAL TAKEHOME EXAMINATION Papers are due before 11:59p.m. on Friday June 27th. Papers will be submitted via a Turnitin link provided on eClass. USING PROPER ESSAY FORMAT, PLEASE ANSWER ONE OF THE...

the bottom of the main loop (after getting user input), increment the current player. Then, if the number is too high, reset it to 0. Before printing whose turn it is, print the board using one of...

Please scan the SEC Plain English that I've attached. Please visit to this link.http://www.sec.gov/Archives/edgar/data/320193/000119312513416534/d590790d10k.htm#toc590790_9 Please read pages 25...

Using the Visual C++ Debugger Acknowledgements: This document is modified from a Web page named CSE/ENGR 142 Debugger Tips written by staff at the University of Washington's Computer Science...

Read Chapter 1: A Burger, Fries, and a Side of Improv from the Applied Improvisation textbook (see Syllabus on Canvas for complete textbook information) and write in your learning journal on Google...

C HAP TER 1 Culturally Intelligent Leadership Matters The rst time I taught cultural intelligence principles to a group of executives in Minnesota, I miscalculated the time and distance it would take...

Planning is one of the most important management functions in any business. A front office managers first step in planning should involve determine the departments goals. Planning also includes...

Rev.Confirming Pages C H A P T E R 7 Planning, Composing, and Revising Chapter Outline The Ways Good Writers Write Activities in the Composing Process Using Your Time Effectively Brainstorming,...

Assignment #1: Class Associations & Interfaces Due Date: Monday, February 22nd at 11:59 PM (2 weeks) Introduction This semester we will study object-oriented graphics programming and design by...

1. But what about Rifles statement, I want to play. Im your man, Mr. Greenback? Doesnt that indicate a binding agreement? 2. Does the court believe that the uncle did in fact benefit from the bargain?

When it is necessary to communicate corrective feedback, what four guidelines should be applied?

In cost utility analysis, outcomes need to be converted to some monetary unit ( dollars , e . g . ) to complete the analysis true or false

Determine the area of the shaded region in square units 3 5 39 2 5 2 1 5 1 0 5 1 square units 2 3 4 5 1 6 7