Cancer deaths: Suppose for a set of counties i = {1,...,n} we have infor- mation on...
Fantastic news! We've Found the answer you've been seeking!
Question:
Transcribed Image Text:
Cancer deaths: Suppose for a set of counties i = {1,...,n} we have infor- mation on the population size Xį number of people in 10,000s, and Y₁ number of cancer fatalities. One model for the distribution of cancer fa- = = talities is that, given the cancer rate 0, they are independently distributed with Y; Poisson (0X;). a) Identify the posterior distribution of given data (Y₁, X₁),..., (Yn, Xn) and a gamma(a, b) prior distribution. The file cancer_react.dat contains 1990 population sizes (in 10,000s) and number of cancer fatalities for 10 counties in a Midwestern state that are near nuclear reactors. The file cancer_noreact.dat contains the same data on counties in the same state that are not near nuclear reactors. Consider these data as samples from two populations of counties: one is the population of counties with no neighboring reactors and a fatality rate of ₁ deaths per 10,000, and the other is a population of counties having nearby reactors and a fatality rate of 02. In this exercise we will model beliefs about the rates as independent and such that 0₁ gamma (a₁, b₁) and 0₂ gamma(a2, b₂). 02 b) Using the numerical values of the data, identify the posterior distri- butions for ₁ and 2 for any values of (a1, b1, a2, b2). c) Suppose cancer rates from previous years have been roughly 0 = 2.2 per 10,000 (and note that most counties are not near reactors). For each of the following three prior opinions, compute E[0₁|data], E[02/data], 95% quantile-based posterior intervals for 0₁ and 02, and Pr(02 > 01|data). Also plot the posterior densities (try to put p(0₁|data) and p(02|data) on the same plot). Comment on the differences across posterior opinions. = i. Opinion 1: (a₁ =a2 = 2.2 x 100, b₁ b2 = 100). Cancer rates for both types of counties are similar to the average rates across all counties from previous years. 100, a2 ii. Opinion 2: (a₁ = 2.2 × 100, b₁ 2.2, b₁ = 1). Cancer rates in this year for nonreactor counties are similar to rates in previous years in nonreactor counties. We don't have much in- formation on reactor counties, but perhaps the rates are close to those observed previously in nonreactor counties. = iii. Opinion 3: (a₁ =a2 = 2.2, b₁ b2 = 1). Cancer rates in this year could be different from rates in previous years, for both reactor and nonreactor counties. = = d) In the above analysis we assumed that population size gives no infor- mation about fatality rate. Is this reasonable? How would the analysis have to change if this is not reasonable? e) We encoded our beliefs about ₁ and 2 such that they gave no in- formation about each other (they were a priori independent). Think about why and how you might encode beliefs such that they were a priori dependent. Cancer deaths: Suppose for a set of counties i = {1,...,n} we have infor- mation on the population size Xį number of people in 10,000s, and Y₁ number of cancer fatalities. One model for the distribution of cancer fa- = = talities is that, given the cancer rate 0, they are independently distributed with Y; Poisson (0X;). a) Identify the posterior distribution of given data (Y₁, X₁),..., (Yn, Xn) and a gamma(a, b) prior distribution. The file cancer_react.dat contains 1990 population sizes (in 10,000s) and number of cancer fatalities for 10 counties in a Midwestern state that are near nuclear reactors. The file cancer_noreact.dat contains the same data on counties in the same state that are not near nuclear reactors. Consider these data as samples from two populations of counties: one is the population of counties with no neighboring reactors and a fatality rate of ₁ deaths per 10,000, and the other is a population of counties having nearby reactors and a fatality rate of 02. In this exercise we will model beliefs about the rates as independent and such that 0₁ gamma (a₁, b₁) and 0₂ gamma(a2, b₂). 02 b) Using the numerical values of the data, identify the posterior distri- butions for ₁ and 2 for any values of (a1, b1, a2, b2). c) Suppose cancer rates from previous years have been roughly 0 = 2.2 per 10,000 (and note that most counties are not near reactors). For each of the following three prior opinions, compute E[0₁|data], E[02/data], 95% quantile-based posterior intervals for 0₁ and 02, and Pr(02 > 01|data). Also plot the posterior densities (try to put p(0₁|data) and p(02|data) on the same plot). Comment on the differences across posterior opinions. = i. Opinion 1: (a₁ =a2 = 2.2 x 100, b₁ b2 = 100). Cancer rates for both types of counties are similar to the average rates across all counties from previous years. 100, a2 ii. Opinion 2: (a₁ = 2.2 × 100, b₁ 2.2, b₁ = 1). Cancer rates in this year for nonreactor counties are similar to rates in previous years in nonreactor counties. We don't have much in- formation on reactor counties, but perhaps the rates are close to those observed previously in nonreactor counties. = iii. Opinion 3: (a₁ =a2 = 2.2, b₁ b2 = 1). Cancer rates in this year could be different from rates in previous years, for both reactor and nonreactor counties. = = d) In the above analysis we assumed that population size gives no infor- mation about fatality rate. Is this reasonable? How would the analysis have to change if this is not reasonable? e) We encoded our beliefs about ₁ and 2 such that they gave no in- formation about each other (they were a priori independent). Think about why and how you might encode beliefs such that they were a priori dependent.
Expert Answer:
Answer rating: 100% (QA)
2 2 a To identify the posterior distribution of given data 11Y1X1YnXn and a gamma ab prior distribut... View the full answer
Related Book For
Posted Date:
Students also viewed these accounting questions
-
The U.S. Census Bureau publishes information on the population of the United States in Current Population Reports. The following table gives the resident U.S. population, in millions of persons, for...
-
1. The American Management Association wishes to have information on the mean income of store managers in the retail industry. A random sample of 256 managers reveals a sample mean of $45,420. the...
-
The regression equation is computed for a set of n = 18 pairs of X and Y values with a correlation of r = 180 and SSY = 100. a. Find the standard error of estimate for the regression equation. b. How...
-
Big bang Pty (Ltd) is a company which is involved in sale of manufacturing goods. The company had the following income and expenses for the year of assessment: R15 000 for the collection of debts...
-
Does the use of inhaled steroids by children affect their height as adults? Excerpts from the abstract of a study about this are given. Read them and then answer the questions that follow. "Methods:...
-
What is Laplace's law, Eq. 18.51, for a soap bubble? 2y Pin - Pout (spherical surface). (18.51) R
-
Prepare journal entiies to record the following merchandising transactions of Wave Company, which applies the perpetual inventory system. July 3 Purchased merchandise from CAP Corp. for $15,000 under...
-
MacDonald Industries completed the following transactions during 2016: Nov. 1 Made sales of $32,000. MacDonald estimates that warranty expense is 6% of sales. (Record only the warranty expense.) 20...
-
Suresh Company reports the following segment (department) income results for the year. Department M Department N Department O Department P Department T Total Sales $ 68,000 $ 38,000 $ 65,000 $ 47,000...
-
Refer to Figure 11.45. A square footing, 2 x 2 m in size, supports a column load of 300 kN. The soil characteristics are given in the figure. Field monitoring indicated that the foundation settlement...
-
Prepare the necessary journal entries to record the following transactions relating to the long-term issuance of bonds of Pitts Co. (Credit account titles are automatically indented when the amount...
-
The DBMS acts as an interface between what two components of an enterprise-class database system? give reasons
-
Four independent situations are described below. Each involves future deductible amounts and/or future taxable amounts produced by temporary differences: Taxable income Future deductible amounts...
-
An investor is presented with a choice of two investments: an established clothing store and a new computer store. Each choice requires the same initial investment and each produces a continuous...
-
Why is it important to have an ethical organization? What benefits are there to having an ethical organization? What happens to the organization when individuals within the company act unethically?...
-
In 300 words or less, what type of organizational culture do you work in? Professional and customer basedDescribe the characteristics of senior leadership in your culture. How does leadership...
-
Waterway/Neff, Ltd. personalizes scrapbooks for customers, using their digital photographs. Each scrapbook sells for $35. Waterway/Neff collects cash at the time of sale from 30% of customers. Of the...
-
Chloroplasts are illuminated until the levels of the Calvin cycle intermediates reach a steady state. The light is then turned off. How does the level of RuBP vary after this point?
-
Data on salaries in the public school system are published annually in National Survey of Salaries and Wages in Public Schools by the Education Research Service. The mean annual salary of (public)...
-
The U.S. Census Bureau publishes data on the population of the United States by race and Hispanic origin in American Community Survey. From that document, we constructed the following bar chart. Note...
-
a. Determine whether it slopes upward, slopes downward, or is horizontal, without graphing the equation. b. Find its equation. c. Use two points to graph the equation. 1.5 and 0
-
Distinguish among the three types of responsibility
-
Using the information in E7-1, assume that in July 2002, Voss Company incurs the following manufacturing overhead costs. Instructions (a) Prepare a flexible budget performance report, assuming that...
-
Samano Company uses flexible budgets to control its selling expenses. Monthly sales are expected to range from \($170,000\) to \($200,000\). Variable costs and their percentage relationship to sales...
Study smarter with the SolutionInn App