X train, X test, y train, y test train test split , test size 0 2 , random state 4 2 Define the column transformer for preprocessing numeric features ' YrSold ' , 'SaleType' categorical features ' GarageCars ' , ' PoolArea' preprocessor ColumnTransformer ( transformers ( ' num ' , MinMaxScaler ( ) , numeric features ) , ( ' cat ' , OneHotEncoder ( ) , categorical features ) Create Random Forest pipeline rf pipeline Pipeline ( steps ( ' preprocessor ' , preprocessor ) , ( ' regressor ' , RandomForestRegressor ( ) ) ) Create K Nearest Neighbors pipeline knn pipeline Pipeline ( steps ( ' preprocessor ' , preprocessor ) , ( ' regressor ' , KNeighborsRegressor ( ) ) ) Define hyperparameters grid for GridSearchcv rf param grid 'regressor n estimators ' 5 0 , 1 0 0 , 'regressor max depth' 2 , 3 , 4 , 'regressor criterion' ' mse ' , 'mae' knn param grid 'regressor n neighbors' 2 , 5 , 1 0 , 2 0 , 5 0 , 1 0 0 , 'regressor weights' ' uniform ' , 'distance' Perform GridSearchCV with 5 fold cross validation for Random Forest rf grid search GridSearchCV ( rfR Calculate the following evaluation metrics of the 2 models' performance on the training data set Root mean squared error MSE Mean absolute error MAE Mean absolute percentage error MAPE Write a paragraph describing the results and which model and set of hyper parameters worked the best and based on which accuracy metric ( s ) If you were to explore more hyper parameters for each model, how would you expand or limit the current hyperparameter grid as 1 9 df pd read csv ( ' train csv ' ) 2 0 df columns Index ( ' Id ' , 'MSSubClass', 'MSZoning', 'LotFrontage', 'LotArea', 'Street', 'Alley', 'LotShape', 'LandContour', 'Utilities', 'LotConfig', 'LandSlope', 'Neighborhood', 'Condition 1 ' , 'Condition 2 ' , 'BldgType', 'HouseStyle', 'Overallqual', 'Overalicond', 'YearBuilt', 'YearRemodAdd', 'RoofStyle', 'RoofMatl', 'Exterior 1 st ' , 'Exterior 2 nd ' , 'MasVnrType', 'MasVnrArea', 'ExterQual' 'ExterCond', 'Foundation', 'BsmtQual', 'BsmtCond', 'BsmtExposure', 'BsmtFinType 1 ' , 'BsmtFinSF 1 ' , pipeline, rf paran grid, cv 5 )

The Answer is in the image, click to view ...

Question: X _ train, X _ test, y _ train, y _ test = train _ test _ split , test _ size = 0 .

_

train, X

_

test, y

_

train, y

_

test

=

train

_

test

_

split

,

test

_

size

= 0.2,

random

_

state

= 42

# Define the column transformer for preprocessing

numeric

_

features

= ['

YrSold

',

'SaleType'

]

categorical

_

features

= ['

GarageCars

','

PoolArea'

]

preprocessor

=

ColumnTransformer

(

transformers

('

num

',

MinMaxScaler

(),

numeric

_

features

),

('

cat

',

OneHotEncoder

(),

categorical

_

features

)

# Create Random Forest pipeline

_

pipeline

=

Pipeline

(

steps

= [('

preprocessor

',

preprocessor

),

('

regressor

',

RandomForestRegressor

())])

# Create K

-

Nearest Neighbors pipeline

knn

_

pipeline

=

Pipeline

(

steps

= [('

preprocessor

',

preprocessor

),

('

regressor

',

KNeighborsRegressor

())])

# Define hyperparameters grid for GridSearchcv

_

param

_

grid

'regressor

_

_

estimators

'

[50, 100],

'regressor

_

max

_

depth':

[2, 3, 4],

}

'regressor

_

criterion':

['

mse

',

'mae'

]

knn

_

param

_

grid

'regressor

_

_

neighbors':

[2, 5, 10, 20, 50, 100],

}

'regressor

_

weights':

['

uniform

',

'distance'

]

# Perform GridSearchCV with

5 -

fold cross validation for Random Forest

_

grid

_

=

GridSearchCV

(

rfR

Calculate the following evaluation metrics of the

2

models' performance on the training data set:

Root mean squared error MSE

Mean absolute error MAE

Mean absolute percentage error MAPE

Write a paragraph describing the results and which model and set of hyper parameters worked the best and based on which accuracy metric

(

) ?

If you were to explore more hyper parameters for each model, how would you expand or limit the current hyperparameter grid.

[19]

=

.

read

_

csv

('

train

.

csv

')

[20]

.

columns

Index

('

',

'MSSubClass', 'MSZoning', 'LotFrontage', 'LotArea', 'Street',

'Alley', 'LotShape', 'LandContour', 'Utilities', 'LotConfig',

'LandSlope', 'Neighborhood', 'Condition

1',

'Condition

2',

'BldgType',

'HouseStyle', 'Overallqual', 'Overalicond', 'YearBuilt', 'YearRemodAdd',

'RoofStyle', 'RoofMatl', 'Exterior

1

',

'Exterior

2

',

'MasVnrType',

'MasVnrArea', 'ExterQual': 'ExterCond', 'Foundation', 'BsmtQual',

'BsmtCond', 'BsmtExposure', 'BsmtFinType

1',

'BsmtFinSF

1',_

pipeline, rf

_

paran

_

grid, cv

= 5)

X_train, X_test, y_train, y_test = train_test_split , test_size =0.2, random_state =42

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!

P, FP, TN, and FN respectively? Given a confusion matrix below, please (1) show which values are T (2) Calculate the accuracy according to the given equation. 4 Total Predicted Predicted buy...

can you please edit this code to fit the test in python: import argparse import logging import sys import os import time import numpy as np import gym from gym import wrappers, logger class...

1) X is a random variable having pdf T(20) fx(1 0) =(@) r(e) 0-1 (1- 2) [(0,1) (2), where 0 c 0 = {1, 2}. (a) Using X as a test statistic, give the rejection region for the MP size 0.1 test of Ho : 0...

import numpy as np #machine learning tool used for efficient array processing import pandas as pd #machine learning tool used for data sets and data frames from sklearn.model _ selection import train...

P1 Make use of the scikit-learn (sklearn) python package in your function implementations Complete train_test_split function Using the train_test_split function from sklearn implement a function that...

Assignment 3: Nave Bayes Classifier for Spam Email Prediction Procedure 1) Follows steps in the given Jupyter Notebook file, named Spam Classification Using Naive Bayes.ipynb, to go through text data...

Jupyter Notebook Now that we have tried our hand at some single-layer nets, let's see how they stack up compared to multi-layer nets. :) We will be exploring the basic concepts of learning non-linear...

Make use of the scikit-learn (sklearn) python package in your function implementations Complete train_test_split function Using the train_test_split function from sklearn implement a function that...

Please help me solve file DAC's Classification and Evaluation according to the pictured instructions. #Step 1 : # Import libraries # In this section, you can use a search engine to look for the...

Files to Edit and Submit: You will need to edit and submit ( DACs _ classification.py , evaluation.py ) files to implement your model for the pulsar dataset. You can copy and paste all the necessary...

Why do the stated (contract) rate and the effective rate (yield) of interest on bonds frequently differ?

7. (a) Show that the de Broglie wavelength of a particle, of charge e, rest mass mo, moving at relativistic speeds is given as a function of the accelerating potential V as h -1/2 V2moeV eV 1 +...

Professional judgment is influenced by: Organizationat vatues Personal code of ethics Personal behavioral traits Organizational dissonance

If today is February 28, 2026, what is the fair value for an S&P 500 index futures contract expiring on June 20, 2026? Information: S&P 500 Cash Index: 1,430 Interest Rate: 0.19% Days Until...

Team learning. This begins with the capacity of members of a team to suspend judgement and start to think together and to recognize the patterns of interaction within a team that militate against...

makes human resource development strategy central to business policy in order that the process of individual and organizational learning becomes a major business activity;

provides a continuous process of organizational transformation designed to harness the products of individual learning in order to make fundamental changes in assumptions, goals, norms and operating...