Question: NEED HELP IN (R) # Call the ISLR library and check the head of College (a built-in data frame # with ISLR, use data() to

NEED HELP IN (R)

# Call the ISLR library and check the head of College (a built-in data frame # with ISLR, use data() to check this.) Then reassign College to a dataframe # called df code here

# EDA # Let's explore the data! # Create a scatterplot of Grad.Rate versus Room.Board, colored by the # Private column.

code here

# Create a histogram of full time undergrad students, color by Private. code here

# Create a histogram of Grad.Rate colored by Private. You should see something odd here. code here

# What college had a Graduation Rate of above 100% ? code here

# Change that college's grad rate to 100% code here

# Train Test Split # Split your data into training and testing sets 70/30. Use the caTools # library to do this.

code here

# Decision Tree # Use the rpart library to build a decision tree to predict whether or not a # school is Private. Remember to only build your tree off the training data.

code here

# Use predict() to predict the Private label on the test data. code here

# Check the Head of the predicted values. You should notice that you actually have two columns with the probabilities. code here

# Turn these two columns into one column to match the original Yes/No Label # for a Private column. code here

# Lots of ways to do this joiner <- function(x){ if (x>=0.5){ return('Yes') }else{ return("No") } } tree.preds$Private <- sapply(tree.preds$Yes,joiner) head(tree.preds)

# Now use table() to create a confusion matrix of your tree model. code here

# Use the rpart.plot library and the prp() function to plot out your tree # model.

code here

# Random Forest # Now let's build out a random forest model! # Call the randomForest package library library(randomForest)

# Now use randomForest() to build out a model to predict Private class. # Add importance=TRUE as a parameter in the model. (Use help(randomForest) # to find out what this does. code here

# What was your model's confusion matrix on its own training set? # Use model$confusion. code here

# Grab the feature importance with model$importance. Refer to the reading # for more info on what Gini[1] means.[2] code here

# Predictions # Now use your random forest model to predict on your test set! code here

# It should have performed better than just a single tree, how much better # depends on whether you are emasuring recall, precision, or accuracy as # the most important measure of the model.

#Ref: www.pieriandata.com

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!

R Language Help # Call the ISLR library and check the head of College (a built-in data frame # with ISLR, use data() to check this.) Then reassign College to a dataframe # called df code here # EDA #...

R project help # KNN Project # Since KNN is such a simple algorithm, we will just use this "Project" as a # simple exercise to test your understanding of the implementation of KNN. # By now you...

NEED HELP FOR CODDING IN (R) # KNN Project # Since KNN is such a simple algorithm, we will just use this "Project" as a # simple exercise to test your understanding of the implementation of KNN. # By...

Calculate Pearson's correlation coefficient () between the variables Weight and Head in the babyanth.complete data frame using the following formula. Include the correlation value you calculated in a...

Topic: Machine Learning Run cach R program and save the output using Snipping Tool and then paste it into a Word document. Zip up all files (document and R programs) into a single zipped file and...

Confirming Pages C H A P T E R 19 Analyzing Information and Writing Reports Chapter Outline Using Your Time Efficiently Analyzing Data and Information for Reports Identifying the Source of the Data...

********************node1.h************** ******************node1.cpp*************** #include "Node.h" #include // Provides assert #include // Provides NULL and size_t using namespace std; namespace...

Work is to be done in R The Credit dataset comprises simulated information on 4 0 0 customers. Our objective is to develop a prediction model capable of forecasting their credit balances to aid NMU...

Part 1 College data set is available in ISLR Library. Load the College data in the R environment by loading the ISLR library. Description of College data set available at ISLR Library: Statistics for...

VIEW THE STEP-BY-STEP SOLUTION TO: Title: ABC Appliance Inherent Risk and Control Design Assessment Prepare Inherent Risks and Control Design Assessment. This course project/case... I.Title: ABC...

If you earned $80,000 this year, you would pay more CPP than your brother, who earned $60,000. Agree or disagree, and explain your answer.

You are an independent financial planner in Sydney, Australia. It is currently April 2020 and you have just concluded a meeting with your client below. Client: Muhammad. Marital Status: Single,...

If an investor wants to build a bond portfolio that maintains a stable value, she would purchase bonds with

Compared with half a century ago, adoption has become _ _ _ _ _ _ _ _ _ common, but it is more open and acceptabl e , so we probably discuss it _ _ _ _ _ _ _ . fill in the blanks more or much less or...

Define and measure service productivity.

Understand return on quality and determine the optimal level of reliability.

Understand how to integrate all the tools to improve the quality and productivity of customer service processes.