# Question: Suppose you are trying to explain variations in salaries for

Suppose you are trying to explain variations in salaries for technicians in a particular field of work. The file P11_71.xlsx contains annual salaries for 200 technicians. It also shows how many years of experience each technician has, as well as his or her education level. There are four education levels, as explained in the comment in cell D1. Three suggestions are put forth for the relationship between Salary and these two explanatory variables:

• You should regress Salary linearly versus the two given variables, Yrs Exper and Educ Lev.

• All that really matters in terms of education is whether the person got a college degree or not. Therefore, you should regress Salary linearly versus Yrs Exper and a dummy variable indicating whether he or she got a college degree.

• Each level of education might result in different jumps in salary. Therefore, you should regress Salary linearly versus Yrs Exper and dummy variables for the different education levels.

a. Run the indicated regressions for each of these three suggestions. Then

(1) Explain what each equation is saying and how the three are different (focus here on the coefficients),

(2) Which you prefer, and

(3) Whether (or how) the regression results in your preferred equation contradict the average salary results shown in the Pivot Table sheet of the file.

b. Consider the four workers shown on the Prediction sheet of the file. Using your preferred equation, calculate a predicted salary and a 95% prediction interval for each of these four workers.

c. It turns out (you don’t have to check this) that the interaction between years of experience and education level is not significant for this data set. In general, however, argue why you might expect an interaction between them for salary data of technical workers. What form of interaction would you suspect?

• You should regress Salary linearly versus the two given variables, Yrs Exper and Educ Lev.

• All that really matters in terms of education is whether the person got a college degree or not. Therefore, you should regress Salary linearly versus Yrs Exper and a dummy variable indicating whether he or she got a college degree.

• Each level of education might result in different jumps in salary. Therefore, you should regress Salary linearly versus Yrs Exper and dummy variables for the different education levels.

a. Run the indicated regressions for each of these three suggestions. Then

(1) Explain what each equation is saying and how the three are different (focus here on the coefficients),

(2) Which you prefer, and

(3) Whether (or how) the regression results in your preferred equation contradict the average salary results shown in the Pivot Table sheet of the file.

b. Consider the four workers shown on the Prediction sheet of the file. Using your preferred equation, calculate a predicted salary and a 95% prediction interval for each of these four workers.

c. It turns out (you don’t have to check this) that the interaction between years of experience and education level is not significant for this data set. In general, however, argue why you might expect an interaction between them for salary data of technical workers. What form of interaction would you suspect?

## Relevant Questions

The file P03_55.xlsx contains baseball data on all MLB teams from during the years 2004–2009. For each year and team, the total salary and the number of (regular-season) wins are listed.a. Rearrange the data so that there ...The file P03_63.xlsx contains 2009 data on R&D expenses and many financial variables for 85 U.S. publicly traded companies in the computer and electronic product manufacturing industry. The question is whether R&D expenses ...The number of reported accidents at a manufacturing plant located in Flint, Michigan, was recorded at the start of each month. These data are provided in the file P12_05.xlsx. Is this time series random? Perform a runs test ...The file P12_13.xlsx contains quarterly data on GDP. a. Look at a time series plot of GDP. Does it suggest a linear relationship; an exponential relationship?b. Use regression to estimate a linear relationship between GDP ...The purpose of this problem is to get you used to the concept of autocorrelation in a time series. You could do this with any time series, but here you should use the series of Walmart daily stock prices in the file ...Post your question