Question: In Example 16.15, we considered a count data model for the number of doctor visits by an individual as a function of a few explanatory

In Example 16.15, we considered a count data model for the number of doctor visits by an individual as a function of a few explanatory variables. In this exercise, we expand the analysis using a larger data set in the data file, rwm 88 , and more explanatory variables. Adjust the data in the following ways: (i) omit individuals for whom $H H N I N C 2=0$; (ii) create the variable $L I N C=\ln (H H N I N C 2)$; (iii) create $A G E 2=A G E^{2}$; (iv) create the variable $P O S T=1$ (a postsecondary degree indicator variable) if $F A C H H S=1$ or if $U N I V=1$, and $P O S T=0$ otherwise.

a. Using the first 3000 observations estimate a Poisson model explaining DOCVIS as a function of FEMALE, AGE, AGE2, SELF, LINC, POST, and PUBLIC. Discuss the signs and the significance of the coefficients on FEMALE, SELF, POST, and PUBLIC. Calculate the percentage increase in the expected number of doctor visits for each factor represented by these indicator variables.

b. Compute the estimated percentage change in the expected number of doctor visits associated with another year of age for a person who is 30 years old; who is 50 years old; and who is 70 years old.

c. Interpret the estimated coefficient of LINC.

d. Calculate the expected number of doctor visits for each person, EDOCVIS, and round this value to the nearest integer to obtain NVISITS, the predicted number of visits for each person. Create a variable that indicates a successful prediction. Let SUCCESS $=1$ if NVISITS $=$ DOCVIS and $S U C C E S S=0$ otherwise. What is the percentage of successful predictions for observations $1-3000$ ? What is the percentage of successful predictions for the remaining 979 observations?

e. Create SUCCESS1 which indicates a successful prediction of more than one doctor visit. That is, create a variable DOCVIS1 = 1 if an individual has more than one doctor visit, and PREDICT1 $=1$ if the model has predicted more than one doctor visit. Let $S U C C E S S 1=1$ if DOCVIS1 $=$ PREDICT1 and SUCCESS1 $=0$ otherwise. What is the percentage of successful predictions of more than one doctor visit for observations 1-3000? What is the percentage of successful predictions of more than one doctor visit for the remaining 979 observations?

Data From Example 16.15:-

The economic analysis of the health care system is a vital area of research and public interest. In this example, we consider data used by Riphahn, Wambach, and Million (2003). ${ }^{22}$ The data file rwm88_small contains data on 1,200 individuals' number of doctor visits in the past three months (DOCVIS), their age in years ( $A G E$ ), their sex (FEMALE), and whether or not they had public insurance (PUBLIC). The frequencies of doctor visits are illustrated in Table 16.6, with $90.5 \%$ of the sample having eight or fewer visits.

TABLE 16.6 Number of Doctor Visits (DOCVIS) T0 1234 DOCVIS 0 Number

Applying maximum likelihood estimation, we obtain the fitted model

443 1 200 163 111 51 5 49 6 37 7 7

What can we say about these results? First, the coefficient estimates are all positive, implying that older individuals, females and those with public health insurance will have more doctor visits. Second, the coefficients of $A G E$, FEMALE and PUBLIC are significantly different from zero, with $p$-values less than 0.01 . Using the fitted model, we can estimate the expected number of doctor visits. For example, the first person in the sample is a 29-year-old female who has public insurance. Substituting these values we estimate her expected number of doctor visits to be 2.816 , or 3.0 rounded to the nearest integer. Her actual number of doctor visits was zero.
Using the notion of generalized- $R^{2}$, we can get a notion of how well the model fits the data by computing the squared correlation between DOCVIS and the predicted number of visits. If we use the rounded values, for example, 3.0 instead of 3.33 , the correlation is 0.1179 giving $R_{g}^{2}=(0.1179)^{2}=0.0139$. The fit for this simple model is not very good as we might well expect. This model does not account for so many important factors, such as income, general health status, and so on. Different software packages report many different values, sometimes called pseudo- $R^{2}$,
with different meanings as well. We urge you to ignore all these values, including $R_{g}^{2}$.
Instead of an $R^{2}$-like number, it is a good idea to report a test of overall model significance, analogous to the overall $F$-test for the regression model. The null hypothesis is that all the model coefficients, except the intercept, are equal to zero. We recommend the likelihood ratio statistic. See Section 16.2.7 for a discussion of this test in the context of the probit model. The test statistic is $L R=2\left(\ln L_{U}-\ln L_{R}\right)$ where $\ln L_{U}$ is the value of the $\log$-likelihood function for the full and unrestricted model and $\ln L_{R}$ is the value of the log-likelihood function for the restricted model that assumes that the hypothesis is true. The restricted model in this case is $E(D O C V I S)=\exp \left(\gamma_{1}\right)$. If the null hypothesis is true, the $L R$ test statistic has a $\chi_{(3)}^{2}$-distribution in large samples. In our example, $L R=174.93$ and the 0.95 percentile of the $\chi_{(3)}^{2}$-distribution is 7.815 . We reject the null hypothesis at the $5 \%$ level of significance, and we conclude that at least one variable makes a significant impact on the number of doctor visits.
What about the magnitudes of the effects of these variables on the number of doctor visits? Treating $A G E$ as continuous we can use (16.29) to compute a marginal effect,

8 25

To evaluate this effect, we must insert values for $A G E$, FEMALE, and PUBLIC. Let FEMALE $=1$ and PUBLIC $=1$.

If $A G E=30$, the estimate is 0.0332 , with the $95 \%$ interval estimate being [0.0261, 0.0402]. That is, we estimate for a 30 -year-old female with public insurance an additional year of age will increase her expected number of doctor visits in a 3-month period by 0.0332 . Because the marginal effect is a nonlinear function of the estimated parameters, the interval estimate uses a standard error calculated using the delta method. For $A G E=70$, it is $0.0528[0.0355,0.0702]$. The effect of another year of age is greater for older individuals, as you would expect.
Both FEMALE and PUBLIC are indicator variables, taking values zero and one. For these variables, we cannot evaluate the "marginal effect" using a derivative. Instead, we estimate the difference between the expected number of doctor visits for the two cases. For example,

image text in transcribed

The calculated value of the difference is

image text in transcribed

We estimate the difference for a 30 -year-old female to be 1.24 $[1.00,1.48]$, and for a 70 -year-old female, it is 1.98 [1.59, 2.36]. Women with public insurance visit the doctor significantly more than women of the same age who do not have public insurance.

Data From Equation 16.29:-

image text in transcribed

TABLE 16.6 Number of Doctor Visits (DOCVIS) T0 1234 DOCVIS 0 Number 443 1 200 163 111 51 5 49 6 37 7 7 8 25

Step by Step Solution

★★★★★

3.35 Rating (155 Votes )

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Introductory Econometrics Modern Questions!

Microkernel operating systems aim to address perceived modularity and reliability issues in traditional "monolithic" operating systems. (i) Describe the typical architecture of a microkernel...

s sf Define the terms opaque type and concrete type. [5 marks] The following is a shortened version of one of the definition modules described in the Modula-2 user manual: Provide a suitable...

Miller-Rabin test to check whether a number N is composite. This will involve computing a N1 mod N for some value of a. [10 marks] Carry out the steps for N = 65 and a = 1, 2, 8 and 12. on what each...

Let cons denote consumption and inc denote income. Which of the following model meets assumption MLR.3? a. cons = ?0 + ?1inc + ?2(inc/1,000) + u b. log(cons) = ?0 + ?1log(inc) + ?2log(inc2) + u c....

What effect does having public health insurance have on the number of doctor visits a person has during a year? Using 1988 data, rwm88_small, from Germany we will explore this question. The data file...

What effect does having public health insurance have on the number of doctor visits a person has during a year? Using 1988 data,rwm88_small, from Germany we will explore this question. The data file...

The purpose of this assignment is to be able to critique a research article including critically examining its strengths and weaknesses, internal and external validity, and where appropriate,...

Exercise 3 Consider the analysis of quarterly data, from 1980 to 2018, of the variables GDP (income), CAP (stock of capital) and LAB (stock of labour). 1. Employ the Engle and Granger (EG) procedure...

Give a detailed work please thankyou 7. Knowledge of economics is divided into 3 groups: 20% of the population is Excellent, 60% Good, and 20% Weak. Past studies have found that 70% of those in the...

Two astronauts at rest face each other in space. One, with mass m 1 , throws a ball of mass m b to the other, whose mass is m 2 . She catches the ball and throws it back to the first astronaut. If...

Is a promise to pay more for its performance generally not enforceable?

The 2 0 0 8 global financial crisis began in the subprime mortgage market in the United States and spread to banking systems worldwide. Question 4 1 options: TrueFalse

Lisali Company gathered the following information related to inventory that it owned on December 31, 2009: LO4 $100,000 Historical cost Replacement cost Net realizable value Normal profit margin...

Bonnies charitable contributions and AGI for the past four years were as follows: What is the amount of the charitable deduction for each year and the order in which the deduction and carryovers are...

Tim and Monica Nelson are married, file a joint return, and are your newest tax clients. They provide you with the following information relating to their 2016 tax return: 1. Tim works as a...

Mark Hancock is a self-employed attorney who operates his law practice as an unincorporated sole proprietorship. In 2015, the IRS disallowed several business deductions he took in 2013 and 2014. In...

Hart Enterprises recently paid a dividend, D0, of $1.25. It expects to have nonconstant growth of 12% for 2 years followed by a constant rate of 9% thereafter. The firm's required return is 20%. What...

DISTRIBUTION The product will be manufactured a t a local New England factory, drop-shipped to a storage facility, and shipped via UPS to the consumer. Initially, inventory can be carried at no cost...

1 . user file: - user _ id ( primary key ) - name - email - phone _ number - password - user _ type ( Client / Admin ) 2 . Appointments file: - appointment _ id ( primary key ) - user _ id ( foreign...