Question: Using RStudio please provide step by step with code Please use the data set below to answer the questions and refer the data set as
Using RStudio please provide step by step with code
Please use the data set below to answer the questions and refer the data set as movies
The data set may not match the answers to some of the problems, it is the R code to get the answer that I need help with
Data Set:
title originallanguage releasedate popularity votecount voteaverage budget revenue runtime collection year month comedy scifi fantasy action sequel number
Smile en EE
The Black Phone en E
Jeepers Creepers: Reborn en E
Nope en E
X en E
Dahmer en
Halloween Kills en E
Scream en EE
Orphan en E
Jeepers Creepers en
Which of the following is an assumption of the regression model?
check all that apply
A At least one independent variable is endogenous.
B Regression errors are independently and identically distributed same mean & variance
C Regression errors are normally distributed around the population mean of the dependent variable.
D Independent variables are collinear with each other.
What does the residual plot tell you about the regression? Do you spot any abnormalities?
Estimate Ramsay's RESET test on the regression. Which of the following statements best describes the results of the test.
A We fail to reject the null hypothesis of linearity at the level.
B There is not sufficient evidence to determine whether a linear functional form is correct.
C The null hypothesis of linearity is rejected at the level.
D The null hypothesis of nonlinearity is rejected at the level.
Estimate a loglog version of the above regression ie all variables are transformed by the natural logarithm A increase in the budget is associated with increase in revenue.
Hint: You will need to convert voteaverage to voteaverage" before you can take the natural logarithm.
a a
b a $
c an approximately
d a $
Ramsey's RESET test on the loglog regression model is Select significant or not significant at the or suggesting that Select higher order terms should be added or no further transformations are necessary The residual plot for the loglog regression is Select less or more balanced around or but the variance indicates Select heteroskedasticity or homoskedasticity
Is multicollinearity an issue with the above regression?
A Yes, budget and revenue have a correlation of
B No since multicollinearity only biases the regression coefficients not the standard errors.
C No the absolute value of the cross correlation between all of the independent variables is less than
D Yes, the variance inflation factor of 'budget' is more than
The loglog regression above Select does not exhibit or exhibits heteroskedasticity based on the Select Cooks Distance or ShapiroWilk or BreuschPagan or Ramsay RESET test. The null hypothesis was Select rejected or not rejected at the level.
What is the standard error on the log of budget when the errors are corrected for heteroskedasticity?
round to decimal places
For which regression do we reject the null hypothesis of normallydistributed errors at the significance level?
check all that apply
Ast standard, linear regression
Bnd loglog regression
Which of the following movies is considered an influential point in the standard, linear regression ie not the loglog regression
Estimate the following linear regression.
revenue budget verage year
Upload a picture of the residual plot.
Step by Step Solution
There are 3 Steps involved in it
1 Expert Approved Answer
Step: 1 Unlock
Question Has Been Solved by an Expert!
Get step-by-step solutions from verified subject matter experts
Step: 2 Unlock
Step: 3 Unlock
