Question: Using RStudio please provide step by step with code Please use the data set below to answer the questions and refer the data set as

Using RStudio please provide step by step with code
Please use the data set below to answer the questions and refer the data set as movies
The data set may not match the answers to some of the problems, it is the R code to get the answer that I need help with
Data Set:
title original_language release_date popularity vote_count vote_average budget revenue runtime collection year month comedy scifi fantasy action sequel number
Smile en 9/23/20221863.6281146.81.70E+074.50E+07115020229000001
The Black Phone en 6/22/20221071.39827367.9188000001.61E+08103020226000001
Jeepers Creepers: Reborn en 9/15/2022821.6051255.82.00E+072892594889489920229000014
Nope en 7/20/2022733.112168476.80E+07170800000130020227010001
X en 3/17/2022543.6710356.81.00E+071425760910695028920223000001
Dahmer en 6/21/2002378.392205.2250000144008101020026000001
Halloween Kills en 10/14/2021315.02719416.72.00E+07131647155105913612021100000110
Scream en 1/12/2022291.19617326.72.40E+071.40E+08114260220221000015
Orphan en 7/24/2009265.858440372.00E+077791225112376019320097000001
Jeepers Creepers 3 en 9/26/2017246.1896924.9620000036000001009489920179000013
1) Which of the following is an assumption of the regression model?
(check all that apply)
A) At least one independent variable is endogenous.
B) Regression errors are independently and identically distributed (same mean & variance).
C) Regression errors are normally distributed around the population mean of the dependent variable.
D) Independent variables are collinear with each other.
2) What does the residual plot tell you about the regression? Do you spot any abnormalities?
3) Estimate Ramsay's RESET test on the regression. Which of the following statements best describes the results of the test.
A) We fail to reject the null hypothesis of linearity at the 0.05 level.
B) There is not sufficient evidence to determine whether a linear functional form is correct.
C) The null hypothesis of linearity is rejected at the 0.05 level.
D) The null hypothesis of nonlinearity is rejected at the 0.05 level.
4) Estimate a log-log version of the above regression (i.e. all variables are transformed by the natural logarithm). A 1% increase in the budget is associated with _________ increase in revenue.
(Hint: You will need to convert vote_average to "1+ vote_average" before you can take the natural logarithm.)
a) a 92%
b) a $0.92
c) an approximately 0.92%
d) a $92
5) Ramsey's RESET test on the log-log regression model is [ Select ][significant or not significant] at the 0.05 or suggesting that [ Select ][higher order terms should be added or no further transformations are necessary]. The residual plot for the log-log regression is [ Select ][less or more] balanced around 0 or but the variance indicates [ Select ][heteroskedasticity or homoskedasticity].
6) Is multicollinearity an issue with the above regression?
A) Yes, budget and revenue have a correlation of 0.56.
B) No since multicollinearity only biases the regression coefficients not the standard errors.
C) No, the absolute value of the cross correlation between all of the independent variables is less than 0.18.
D) Yes, the variance inflation factor of 'budget' is more than 10.
7) The log-log regression above [ Select ][does not exhibit or exhibits] heteroskedasticity based on the [ Select ][Cook's Distance or Shapiro-Wilk or Breusch-Pagan or Ramsay RESET] test. The null hypothesis was [ Select ][rejected or not rejected] at the 0.001 level.
8) What is the standard error on the log of budget when the errors are corrected for heteroskedasticity?
(round to 3 decimal places)
9) For which regression do we reject the null hypothesis of normally-distributed errors at the 0.05 significance level?
(check all that apply)
A)(1st) standard, linear regression
B)(2nd) log-log regression
10) Which of the following movies is considered an influential point in the standard, linear regression (i.e. not the log-log regression)?
11) Estimate the following linear regression.
revenue =0+1 budget +2voteaverage +3 year +loni
Upload a picture of the residual plot.

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!