Question: I dont know2 where to start on this problem please help 2.} The data set named HW 6.2 contains a random sample of 35 movies
I dont know2 where to start on this problem please help

2.} The data set named "HW 6.2" contains a random sample of 35 movies released in 2003 collected from the Internet Movie Database (IMDb}. The goal of this problem is to explore ifthe information available soon after a movie's theatrical release can successfully predict total revenue. All dollar amounts {i.e., variables "Budget", "Opening", and "USRevenue\") are measured in millions of dollars. a. Investigate the relationship between the explanatory variable \"Budget" and response variable "USRevenue" by doing the following: i. Make a scatterplot. ii. Calculate the correlation coefcient. iii. Interpret the scatterplot and correlation coefficient in terms of trend, strength, and shape. b. Repeat part (a) for the explanatory variable "Opening\". Repeat part (a) for the explanatory variable "Theaters\". d. Based on your findings in parts {a} through {c}, which of the three explanatory variables would be most appropriate for predicting the response variable "U5Revenue"? Justify your choice in a few sentences. e. For the "most appropriate" variable identied in part {d}, run a Simple Linear Regression analysis. Please include: i. The regression equation. ii. Interpret the slope of the regression line [in context of this data set). iii. Is it meaningful to interpret the yintercept? Why or why not? iv. State rsquared {i.e., the coefficient of determination} and explain what this P value means {in context of the data set}
Step by Step Solution
There are 3 Steps involved in it
Get step-by-step solutions from verified subject matter experts
