Question: Question no 1: Use Excel and the hepatitis dataset. Answer the following questions: (1+1+1+1+1+2=7) a. Probability of a Male patient being dead. b. There is

Question no 1: Use Excel and the hepatitis dataset. Answer the following questions: (1+1+1+1+1+2=7)

a. Probability of a Male patient being dead.

b. There is one patient with attribute ANOREXIA value to be "?" -- question is, what is the likely value of this attribute for this patient?

c. What is the probability that a patient between age 10 and 50 use steroid? (Replace ? with Yes)

d. Which one is more likely, a person with no ANTIVIRALS being Alive or a person with MALAISE being dead?

e. Which Age group is more likely to be dead ? What are the probabilities? (Group the ages in 3 groups. 20-40, 40-60, 60-80)

f. Is the age attribute normally distributed? Reason why or why not?

2. Use Excel/Python and the Hepatitis dataset: (3+2= 5)

  1. Create 3 different visualizations showing the mean and standard deviation (or standard error as it is referred to in this context) of the sampling distributions of sample age for sample sizes: 2, 5, 10
  2. What happens to the mean of the sample means of age as the sample size is increased? What happens to the standard error ?

Question no 3: USE PYTHON (1+2+2)

a. Generate a discrete uniform distribution of population size 100 between interval (1,10).)

b Consider the sample size of N=10, Simulate the sampling distribution of the sample mean.

c Consider the sample size of N=30, what is the sample mean and sample standard deviation?

https://pastebin.com/jqSQYqhj

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!