Question: Here is my code in R # Install necessary packages install.packages ( ' alr 4 ' ) install.packages ( ' doParallel ' ) install.packages (

Here is my code in R
# Install necessary packages
install.packages('alr4')
install.packages('doParallel')
install.packages('caret')
# Load the required libraries
library(alr4)
library(caret)
library(doParallel)
library(foreach)
# Load the MinnLand dataset
data(MinnLand)
# Check column names to confirm correct variable names
colnames(MinnLand)
# Remove rows with NA values
Minnland <- na.omit(MinnLand)
# Convert year to factor
MinnLand$year <- as.factor(MinnLand$year)
# Define formulas for log and sqrt models
formula_log <- log(acrePrice) ~ .
formula_sqrt <- sqrt(acrePrice) ~ .
# Set up parallel processing
cl <- makeCluster(detectCores()-1)
registerDoParallel(cl)
# 5-Fold Cross Validation
train_control_5fold <- trainControl(method ='cv', number =5)
# 10-Fold Cross Validation
train_control_10fold <- trainControl(method ='cv', number =10)
# Random Splitting with 3 splits
train_control_random <- trainControl(method = 'repeatedcv', number =5, repeats =3)
# Fit log models
model_log_5fold <- train(formula_log, data = MinnLand, method =
"lm", trControl = train_control_5fold)
model_log_10fold <- train(formula_log, data = MinnLand, method =
"lm", trControl = train_control_10fold)
model_log_random <- train(formula_log, data = MinnLand, method =
"lm", trControl = train_control_random)
# Fit sqrt models
model_sqrt_5fold <- train(formula_sqrt, data = MinnLand, method ='lm', trControl = train_control_5fold)
model_sqrt_10fold <- train(formula_sqrt, data = MinnLand, method ='lm', trControl = train_control_10fold)
model_sqrt_random <- train(formula_sqrt, data = MinnLand, method ='lm', trControl = train_control_random)
# Compare models
print(paste("Log Model 5-fold RMSE:",
model_log_5fold$results$RMSE))
print(paste("Log Model 10-Fold RMSE:",
model_log_10fold$results$RMSE))
print(paste("Log Model Random RMSE:",
model_log_random$results$RMSE))
print(paste("Sqrt Model 5-fold RMSE:",
model_sqrt_5fold$resultsRMSE))
print(paste("Sqrt Model 10-Fold RMSE:",
model_sqrt_10fold$results$RMSE))
print(paste("Sqrt Model Random RMSE:",
model_sqrt_random$results$RMSE))
# Stop parallel processing
stopCluster(cl)
I am getting the following error:
> # Fit log models
> model_log_5fold <- train(formula_log, data = MinnLand, method =
+"lm", trControl = train_control_5....[TRUNCATED]
Error in na.fail.default(list(`log(acrePrice)`= c(6.64118216974059,6.59714570188665, :
missing values in object
What is causing this and how do I fix it?

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Programming Questions!