Cancer deaths: Suppose for a set of counties i = {1,...,n} we have infor- mation on...
Fantastic news! We've Found the answer you've been seeking!
Question:
Transcribed Image Text:
Cancer deaths: Suppose for a set of counties i = {1,...,n} we have infor- mation on the population size Xį number of people in 10,000s, and Y₁ number of cancer fatalities. One model for the distribution of cancer fa- = = talities is that, given the cancer rate 0, they are independently distributed with Y; Poisson (0X;). a) Identify the posterior distribution of given data (Y₁, X₁),..., (Yn, Xn) and a gamma(a, b) prior distribution. The file cancer_react.dat contains 1990 population sizes (in 10,000s) and number of cancer fatalities for 10 counties in a Midwestern state that are near nuclear reactors. The file cancer_noreact.dat contains the same data on counties in the same state that are not near nuclear reactors. Consider these data as samples from two populations of counties: one is the population of counties with no neighboring reactors and a fatality rate of ₁ deaths per 10,000, and the other is a population of counties having nearby reactors and a fatality rate of 02. In this exercise we will model beliefs about the rates as independent and such that 0₁ gamma (a₁, b₁) and 0₂ gamma(a2, b₂). 02 b) Using the numerical values of the data, identify the posterior distri- butions for ₁ and 2 for any values of (a1, b1, a2, b2). c) Suppose cancer rates from previous years have been roughly 0 = 2.2 per 10,000 (and note that most counties are not near reactors). For each of the following three prior opinions, compute E[0₁|data], E[02/data], 95% quantile-based posterior intervals for 0₁ and 02, and Pr(02 > 01|data). Also plot the posterior densities (try to put p(0₁|data) and p(02|data) on the same plot). Comment on the differences across posterior opinions. = i. Opinion 1: (a₁ =a2 = 2.2 x 100, b₁ b2 = 100). Cancer rates for both types of counties are similar to the average rates across all counties from previous years. 100, a2 ii. Opinion 2: (a₁ = 2.2 × 100, b₁ 2.2, b₁ = 1). Cancer rates in this year for nonreactor counties are similar to rates in previous years in nonreactor counties. We don't have much in- formation on reactor counties, but perhaps the rates are close to those observed previously in nonreactor counties. = iii. Opinion 3: (a₁ =a2 = 2.2, b₁ b2 = 1). Cancer rates in this year could be different from rates in previous years, for both reactor and nonreactor counties. = = d) In the above analysis we assumed that population size gives no infor- mation about fatality rate. Is this reasonable? How would the analysis have to change if this is not reasonable? e) We encoded our beliefs about ₁ and 2 such that they gave no in- formation about each other (they were a priori independent). Think about why and how you might encode beliefs such that they were a priori dependent. Cancer deaths: Suppose for a set of counties i = {1,...,n} we have infor- mation on the population size Xį number of people in 10,000s, and Y₁ number of cancer fatalities. One model for the distribution of cancer fa- = = talities is that, given the cancer rate 0, they are independently distributed with Y; Poisson (0X;). a) Identify the posterior distribution of given data (Y₁, X₁),..., (Yn, Xn) and a gamma(a, b) prior distribution. The file cancer_react.dat contains 1990 population sizes (in 10,000s) and number of cancer fatalities for 10 counties in a Midwestern state that are near nuclear reactors. The file cancer_noreact.dat contains the same data on counties in the same state that are not near nuclear reactors. Consider these data as samples from two populations of counties: one is the population of counties with no neighboring reactors and a fatality rate of ₁ deaths per 10,000, and the other is a population of counties having nearby reactors and a fatality rate of 02. In this exercise we will model beliefs about the rates as independent and such that 0₁ gamma (a₁, b₁) and 0₂ gamma(a2, b₂). 02 b) Using the numerical values of the data, identify the posterior distri- butions for ₁ and 2 for any values of (a1, b1, a2, b2). c) Suppose cancer rates from previous years have been roughly 0 = 2.2 per 10,000 (and note that most counties are not near reactors). For each of the following three prior opinions, compute E[0₁|data], E[02/data], 95% quantile-based posterior intervals for 0₁ and 02, and Pr(02 > 01|data). Also plot the posterior densities (try to put p(0₁|data) and p(02|data) on the same plot). Comment on the differences across posterior opinions. = i. Opinion 1: (a₁ =a2 = 2.2 x 100, b₁ b2 = 100). Cancer rates for both types of counties are similar to the average rates across all counties from previous years. 100, a2 ii. Opinion 2: (a₁ = 2.2 × 100, b₁ 2.2, b₁ = 1). Cancer rates in this year for nonreactor counties are similar to rates in previous years in nonreactor counties. We don't have much in- formation on reactor counties, but perhaps the rates are close to those observed previously in nonreactor counties. = iii. Opinion 3: (a₁ =a2 = 2.2, b₁ b2 = 1). Cancer rates in this year could be different from rates in previous years, for both reactor and nonreactor counties. = = d) In the above analysis we assumed that population size gives no infor- mation about fatality rate. Is this reasonable? How would the analysis have to change if this is not reasonable? e) We encoded our beliefs about ₁ and 2 such that they gave no in- formation about each other (they were a priori independent). Think about why and how you might encode beliefs such that they were a priori dependent.
Expert Answer:
Answer rating: 100% (QA)
2 2 a To identify the posterior distribution of given data 11Y1X1YnXn and a gamma ab prior distribut... View the full answer
Related Book For
Posted Date:
Students also viewed these accounting questions
-
The U.S. Census Bureau publishes information on the population of the United States in Current Population Reports. The following table gives the resident U.S. population, in millions of persons, for...
-
1. The American Management Association wishes to have information on the mean income of store managers in the retail industry. A random sample of 256 managers reveals a sample mean of $45,420. the...
-
The regression equation is computed for a set of n = 18 pairs of X and Y values with a correlation of r = 180 and SSY = 100. a. Find the standard error of estimate for the regression equation. b. How...
-
Big bang Pty (Ltd) is a company which is involved in sale of manufacturing goods. The company had the following income and expenses for the year of assessment: R15 000 for the collection of debts...
-
Using the same four main actors as in question 7-14, this time show with the arrows how the material (the water bottles and cages) moves.
-
Using Eq. (1.5), determine the ac resistance at a current of 1 mA and 15 mA for the diode of Fig. 1.15. Modify the equation as necessary for low levels of diode current. Compare to the solutions...
-
List key principles and attributes of the value management (VM) approach.
-
A 0.06-lb bullet is fired with a horizontal velocity into the lower end of a 45-lb slender bar which is initially at rest in a vertical plane. Knowing that the bullet becomes embedded in the bar and...
-
They would operate because they make a profit of 500. c) if you decide to shut down, explain how your loss would be minimized. (3 points) A B C Total revenue: 2,000 5,000 5,000 Total cost: 1,500...
-
FeVer Beverages is a typical coffee shop that operates at Batangas. Recently, the general manager is facing a challenge about not meeting their target lead time of serving the hot coffee of 15...
-
B. For the Process flow shown below, identify both the capacity of the overall process and identify the bottleneck process operation. First, calculate capacities for each process step. Cycle times...
-
Temperature of the water used: Corrected temperature: TABLE 1.6 mass of the flask + water (g) mass of the flask (g) mass of ~10 mL of water (g) volume delivered by pipet (mL) (show calculations) mean...
-
Solve A = =(a+b)h for h; when A = 205.2 m, a = 16.50 m, b = 19.50 m
-
5. A rigid square sheet of glass of side 1 metre is supported at its corners on four identical stiff linear-elastic springs. A 100 N vertical force is applied to the plate at the location shown in...
-
Decide if the following substances are elements, compounds, or mixtures. Au A: Element B: Compound OC: Mixture copper A: Element OB: Compound OC: Mixture salt and pepper A: Element B: Compound OC:...
-
A boundary layer grows on the wall of a wind tunnel test section, downstream of some flow straightening honeycomb. The boundary layer that develops on the wall is measured to be laminar, and of...
-
1. Let f(x)=- 16 x+2 and find the equation of the tangent line when x = 2
-
Chloroplasts are illuminated until the levels of the Calvin cycle intermediates reach a steady state. The light is then turned off. How does the level of RuBP vary after this point?
-
Data on salaries in the public school system are published annually in National Survey of Salaries and Wages in Public Schools by the Education Research Service. The mean annual salary of (public)...
-
The U.S. Census Bureau publishes data on the population of the United States by race and Hispanic origin in American Community Survey. From that document, we constructed the following bar chart. Note...
-
a. Determine whether it slopes upward, slopes downward, or is horizontal, without graphing the equation. b. Find its equation. c. Use two points to graph the equation. 1.5 and 0
-
There is a vent with an area 0.4 m2 on the wall of the air supply duct. Try to find the average velocity v of the outlet flow.
-
How do differences in income levels and income distribution among countries affect international businesses?
-
Describe the U.S. role in the world economy.
Study smarter with the SolutionInn App