# Question

The table shows data from 27 automotive plants on y = number of assembly defects per 100 cars and x = time (in hours) to assemble each vehicle. The data are in the Quality and Productivity file on the text CD. Number of defects in assembling 100 cars and time to assemble each vehicle

Source: Data from S. Chatterjee, M. Handcock, and J. Simonoff, A Casebook for a First Course in Statistics and Data Analysis (Wiley, 1995); based on graph in The Machine that Changed the World, by J. Womack, D. Jones, and D. Roos (Macmillan, 1990).

a. The prediction equation is = 61.3 + 0.35x. Find the predicted number of defects for a car having assembly time (i) 12 hours (the minimum) and (ii) 54 hours (the maximum).

b. The first 11 plants were Japanese facilities and the rest were not. Let x1 = time to assemble vehicle and x2 = whether facility is Japanese (1 = yes, 0 = no). The fit of the multiple regression model is = 105.0 - 0.78x1 - 36.0x2. Interpret the coefficients that estimate the effect of x1 and the effect of x2.

c. Explain why part a and part b indicate that Simpson’s paradox has occurred.

d. Explain how Simpson’s paradox occurred. To do this, construct a scatterplot between y and x1 in which points are identified by whether the facility is Japanese. Note that the Japanese facilities tended to have low values for both x1 and x2.

Source: Data from S. Chatterjee, M. Handcock, and J. Simonoff, A Casebook for a First Course in Statistics and Data Analysis (Wiley, 1995); based on graph in The Machine that Changed the World, by J. Womack, D. Jones, and D. Roos (Macmillan, 1990).

a. The prediction equation is = 61.3 + 0.35x. Find the predicted number of defects for a car having assembly time (i) 12 hours (the minimum) and (ii) 54 hours (the maximum).

b. The first 11 plants were Japanese facilities and the rest were not. Let x1 = time to assemble vehicle and x2 = whether facility is Japanese (1 = yes, 0 = no). The fit of the multiple regression model is = 105.0 - 0.78x1 - 36.0x2. Interpret the coefficients that estimate the effect of x1 and the effect of x2.

c. Explain why part a and part b indicate that Simpson’s paradox has occurred.

d. Explain how Simpson’s paradox occurred. To do this, construct a scatterplot between y and x1 in which points are identified by whether the facility is Japanese. Note that the Japanese facilities tended to have low values for both x1 and x2.

## Answer to relevant Questions

A chain restaurant that specializes in selling hamburgers wants to analyze how y = sales for a customer (the total amount spent by a customer on food and drinks, in dollars) depends on the location of the restaurant, which ...Example 12 used logistic regression to estimate the probability of having a travel credit card when x = annual income (in thousands of euros). At the mean income of :25,000, show that the estimated probability of having a ...Refer to the previous two exercises. When the explanatory variables are x1 = family income, x2 = number of years of education, and x3 = gender (1 = male , 0 = female ), suppose a logistic regression reports For this sample, ...A MINITAB printout is provided from fitting the multiple regression model to U.S. crime data for the 50 states (excluding Washington, D.C.), on y = violent crime rate, x1 = poverty rate, and x2 = percent living in urban ...Refer to the FL Student Survey data file on the text CD. Using software, conduct a regression analysis using y = college GPA and predictors high school GPA and sports (number of weekly hours of physical exercise). Prepare a ...Post your question

0