Question: Import the data set using pd . read _ excel and report the shape ( rows , columns ) of data. Remove the variables INSTNM

Import the data set using pd

.

read

_

excel and report the shape

(

rows

,

columns

)

of data.

Remove the variables INSTNM and Appl from the dataset. Split the data set into a training set

(60 %)

and a validation set

(40 %)

and use random

_

state

= 5555 .

Keep the variable UNITID but it is just an ID so should NOT be used in any analysis or modeling.

Provide the descriptive statistics of the variable Ug

_

enter, the total number of entering students at undergraduate level.

If we are to predict the variable Ug

_

enter using other variables with linear regression, provide at least two appropriate EDAs prior to modeling.

Drop missing values using

.

dropna

()

function. Report the size of the remaining training and validation data sets.

Fit a linear regression model on the training set to predict Ug

_

enter. Display the model's coefficients and the Mean Squared Error

(

MSE

)

of the validation set.

Perform

5 -

fold CV and show the CV error, i

.

.,

the avergae MSE.

Perform LOOCV and show the CV error.

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Programming Questions!

Assignment 3: Nave Bayes Classifier for Spam Email Prediction Procedure 1) Follows steps in the given Jupyter Notebook file, named Spam Classification Using Naive Bayes.ipynb, to go through text data...

What to submit: For this section of the project, you will be submitting a .Rmd file (LastName_FirstName.Rmd) that will contain all the code and written answers for the steps mentioned below. a...

Why this comment still shows? One or more test cases in this cell did not pass.Instructor hints: 1. "For Problem 2, Part A, look at the shape of X_train."2. "For Problem 2, Part A, look at...

#################################################### # 5 . 3 Assessing Fit of Line # # Run in Groups and discuss # #################################################### install.packages ( " ggplot 2 "...

9 7 9 12 8 10 9 10 10 11 11 12 10 11 10 8 9 10 9 13 12 13 9 11 9 14 8 8 11 12 10 11 11 11 10 7 10 9 14 9 11 10 10 10 11 8 13 10 7 9 11 11 12 11 10 7 7 10 10 10 11 10 9 10 10 13 10 11 10 11 11 9 8 9 9...

I am completing an assignment for my development of supportive systems class. This assignment is working with an excel document and support files. I am using a mac & need help on completing the...

Edit question Here are the draft of my Group Project, title is :The impacts of a well-balanced diet on immunity in combating the COVID-19 virus in various countries How many countries adhere to the...

Here are the draft of my Group Project, title is :The impacts of a well-balanced diet on immunity in combating the COVID-19 virus in various countries we are solvong the 3 questions: 1.How many...

PLEASE HELP ME WITH THIS WHOLE PYTHON PROGRAMMING PROJECT Activity 1: Create Dummy Dataset In this activity, you have to execute the code cell which creates a dummy dataset for multiclass...

ALY6010 Module 1 Project Instructor: Dr. Dee Chiluiza, PhD Discrete probability and normal distributions Overview and Rationale This assignment is designed to provide hands-on experience in...

Investors expected Company X to announce a 10% increase in earnings, However, at the end of the year Company announced a 1% increase If the market is semi-strong from efficient. What would most...

In the locked position shown, the toggle clamp exerts at A vertical 1.2-kN force on the wooden block, and handle CF rests against the stop at G. Determine the force P required to release the clamp....

According the case Meaningful work and unethical work: The crisis in Australian financial advice, Reflect on what you would have done in a similar situation and why?

Discuss how governments can influence exchange rates?