# Question: The following exercise is from Introduction to Regression Modeling and

The following exercise is from Introduction to Regression Modeling and refers to data taken from Higgins and Koch’s, “ Variable Selection and Generalized Chi- Square Analysis of Cat-egorical Data Applied to a Large Cross- Sectional Occupational Health Survey” [ International Sta-tistical Review (1977) 45: 51– 62]. The data were taken from a large survey of workers in the cotton industry. The researchers wanted to study the factors that may be associated with brown lung disease resulting from inhaling particles of cotton, flax, hemp, or jute. The variables are as follows: number of workers suffering from disease (yes); number of workers not suffering from disease (no); dustiness of workplace (1— high; 2— medium; 3— low); race (1— white; 2— other); sex (1— male; 2— female); smoking history (1— smoker; 2— nonsmoker); length of employment in cotton industry (1— less than 10 years; 2— between 10 and 20 years; 3— more than 20 years).

a. List the five covariates from most likely to least likely to be associated with the probability that a cotton worker has brown lung disease.

b. Do there appear to be any interactions between the covariates?

c. Use a statistical software package to obtain a prediction model using all five covariates.

a. List the five covariates from most likely to least likely to be associated with the probability that a cotton worker has brown lung disease.

b. Do there appear to be any interactions between the covariates?

c. Use a statistical software package to obtain a prediction model using all five covariates.

## Answer to relevant Questions

Refer to Exercise 12.4. The cardiologist decides to include two other variables in the model: an indicator variable for sex, male or female; and an indicator variable for diabetes, yes or no. a. Write a first- order general ...Refer to Exercise 12.53. a. Predict the feedlot time required for a steer fed 15% protein, 1.5% antibiotic concentration, and 5% supplement. b. Do these values of the independent variables represent a major extrapolation ...Consider the outlier- deleted regression model of Exercise 12.59. a. Locate the F statistic. What null hypothesis is being tested? What can we conclude based on the F statistic? b. Locate the t statistic for each independent ...Refer to Exercise 12.67. a. Determine a model to relate the probability of proteinuria in a pregnant woman to social class and smoking level. b. Predict the probability of proteinuria in a pregnant woman of social class I ...A pharmaceutical firm would like to obtain information on the relationship between the dose level and potency of a drug product. To do this, each of 15 test tubes is inoculated with a virus culture and incubated for 5 days ...Post your question