Question: To perform k - NN classification and answer the questions, follow these steps in RStudio: # Load required libraries library ( caret ) library (

To perform k

-

NN classification and answer the questions, follow these steps in RStudio:

# Load required libraries

library

(

caret

)

library

(

mlbench

)

# Load the UniversalBank dataset

data

("

UniversalBank

")

# Split the data into training and holdout sets

(60 %

training,

40 %

holdout

)

set.seed

(123)

index

< -

createDataPartition

(

UniversalBank$PersonalLoan, p

= 0.6,

list

=

FALSE

)

train

_

data

< -

UniversalBank

[

index

,]

holdout

_

data

< -

UniversalBank

[-

index,

]

# Define categorical predictors as factors

categorical

_

cols

< -

("

Family

",

"Education", "SecuritiesAccount", "CDAccount", "Online", "CreditCard"

)

train

_

data

[

categorical

_

cols

] < -

lapply

(

train

_

data

[

categorical

_

cols

],

.

factor

)

holdout

_

data

[

categorical

_

cols

] < -

lapply

(

holdout

_

data

[

categorical

_

cols

],

.

factor

)

# Define the new customer's data

new

_

customer

< -

data.frame

(

Age

= 40,

Experience

= 10,

Income

= 84,

Family

= 2,

CCAvg

= 2,

Education

= 2,

Mortgage

= 0,

SecuritiesAccount

= 0,

CDAccount

= 0,

Online

= 1,

CreditCard

= 1

)

# Perform k

-

NN classification with k

= 1

knn

_

model

< -

train

(

PersonalLoan ~

.,

data

=

train

_

data,

method

= "

knn

",

preProcess

=

("

center

",

"scale"

),

tuneGrid

=

expand.grid

(

= 1),

trControl

=

trainControl

(

method

= "

",

number

= 5)

)

# Classify the new customer using the best k

predicted

_

class

< -

predict

(

knn

_

model, new

_

customer

)

predicted

_

prob

< -

predict

(

knn

_

model, new

_

customer, type

=

"prob"

)

# Print the predicted class and probability

(

predicted

_

class

)

(

predicted

_

prob

)

# Confusion matrix for holdout data using the best k

best

_

< -

knn

_

model$bestTune$k

predicted

_

holdout

< -

predict

(

knn

_

model, holdout

_

data

)

confusion

_

matrix

< -

confusionMatrix

(

predicted

_

holdout, holdout

_

data$PersonalLoan

)

(

confusion

_

matrix

)

Explanation:

The task involves using k

-

NN classification to predict whether customers will accept a personal loan offer based on demographic and banking information. Here are the steps:

Load the UniversalBank dataset and split it into training

(60 %)

and holdout

(40 %)

sets.

Define categorical predictors as factors for k

-

.

Define a new customer's data.

Perform k

-

NN classification with k

= 1

on the training data.

Classify the new customer using the best k obtained from

5 -

fold cross

-

validation.

Display the predicted class and probability for the new customer.

Calculate the confusion matrix for the holdout data using the best k

.

This analysis helps determine if the new customer is likely to accept a personal loan offer based on their attributes and previous campaign

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Programming Questions!

1 Assignment 2 Latent Variables and Neural Networks Due Date: 21:59:59 23 May 2021 Please note that, 1. 1 sec delay will be penalized as 1 day delay. So please submit your assignment in advance...

1 WILMINGTON UNIVERSITY LIBRARY Business Continuity Plan (BCP) and Disaster Recovery Plan (DRP) 2 WILMU LIBRARY SERVICES 20,000 student population served 13 physical locations of access to university...

this is the questionplease help me thanksits a project about renovation one level of a building. project management case . This is a renovation project On the basis of group work as a personal...

Let A, B be sets. Define: (a) the Cartesian product (A B) (b) the set of relations R between A and B (c) the identity relation A on the set A [3 marks] Suppose S, T are relations between A and B, and...

Please follow the question and answer it, I need my code in python in rstudio You must only analyse the specified data. No You should explain your decisions with other data is to be used for this...

For the exclusive use of S. Setiawan, 2015. 9-910-036 REV: APRIL 11, 2011 BENJAMIN EDELMAN THOMAS R. EISENMANN Go oogle In nc. Go oogle's mission is to organize the world's inf n nformation and make...

For the exclusive use of F. Ortolano, 2015. 9-910-036 REV: APRIL 11, 2011 BENJAMIN EDELMAN THOMAS R. EISENMANN Go oogle In nc. Go oogle's mission is to organize the world's inf n nformation and make...

Please use python to write this program. Part A. k Nearest Neighbor (kNN) Supervised Learner (40 points) Write a program that performs supervised classification using the kN N algorithm which assigns...

Universal Bank is relatively young bank growing rapidly in terms of overall customer acquisition. The majority of these customers are liability customers depositors) with varying sizes of...

Hello , i badly need your help maam/sir :( I promise to rate as helpful and will provide a feedback. Please ?? Application ? 1. Choose the best sampling method to obtain the individuals in the sample...

Record the following transactions for Cyrus Company. (Credit account titles are automatically indented when the amount is entered. Do not indent manually.) 1. On August 4, Cyrus sold merchandise on...

Determine the radius of gyration about the centroidal x-axis of the figure shown. Round off to the nearest two (2) decimal places. Express your answer in mm. 12 mm 8 mm- y 24 mm- O 12 mm 24 mm: 6 mm...

q , 1 7 . Which one of the following statements is correct? a . The greater the volatility of returns, the greater the risk premium. b . The lower the volatility of returns, the greater the risk...

The Pullman? Mfg., Inc.? three-station work cell illustrated in the figure below has two machines at station 1 in parallel.? (The product needs to go through only one of the two machines before