Question: 1 . R programming: The first task is to write the code to implement the K - Nearest Neighbors, or KNN , model from scratch.

1 .

R programming: The first task is to write the code to implement the K

-

Nearest Neighbors, or KNN

,

model from

scratch. We will do this in steps:

Write a function called euclidean

_

distance that calculates the Euclidean distance between two vectors.

There are two input arguments for this function: vector

1 (

vec

1),

and vector

2 (

vec

2) .

The output for

this function is a numeric, the Euclidean distance

(

euclDist

) .

Write a function called manhattan

_

distance that calculates the Manhattan distance between two

vectors. There are two input arguments for this function: vector

1 (

vec

1),

and vector

2 (

vec

2) .

The

output for this function is a numeric, the Manhattan distance

(

manhDist

) .

Write a function called euclidean

_

distance

_

all that calculates the Euclidean distance between a

vector and all the row vectors in an input data matrix. There are two input arguments for this

function: a vector

(

vec

1)

and an input data matrix

(

mat

1_

) .

The output for this function is a vector

(

output

_

euclDistVec

)

which is of the same length as the number of rows in mat

1_

.

This function

must use the function euclidean

_

distance you previously wrote.

Write a function called manhattan

_

distance

_

all that calculates the Manhattan distance between

a vector and all the row vectors in an input data matrix. There are two input arguments for this

function: a vector

(

vec

1)

and an input data matrix

(

mat

1_

) .

The output for this function is a vector

(

output

_

manhattanDistVec

)

which is of the same length as the number of rows in mat

1_

.

This function

must use the function manhattan

_

distance you previously wrote.

Write a function called my

_

KNN that compares a vector to a matrix and finds its K

-

nearest neighbors.

There are five input arguments for this function: vector

1 (

vec

1),

the input data matrix

(

mat

1_

),

the

class labels corresponding to each row of the matrix

(

mat

1_

),

the number of nearest neighbors you are

interested in finding

(

),

and a Boolean argument specifying if we are using the Euclidean distance

(

euclDistUsed

) .

The argument K should be a positive integer. If the argument euclDistUsed

=

TRUE,

then use the Euclidean distance. Otherwise, use the Manhattan distance. The output of this function

is a list of length

2 (

output

_

knnMajorityVote

) .

The first element in the output list should be a vector

of length K containing the class labels of the closest neighbors. The second element in the output list

should be the majority vote of the K class labels in the first element of the list. The function must use

the functions euclidean

_

distance and manhattan

_

distance you previously wrote.

Apply this function to predict the label of the

123

rd observation using the first

100

observations as your input

training data matrix. Use K

= 10 .

What is the predicted label when you use Euclidean distance? What is

the predicted label when you use Manhattan distance? Are these predictions correct

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!

USE PYTHON3; DO NOT IMPORT ANY PACKAGES Please do all of part 3 and make sure to follow all the requirements; do not add or remove any parameters because it will make the answer incorrect even if the...

Please use python to write this program. Part A. k Nearest Neighbor (kNN) Supervised Learner (40 points) Write a program that performs supervised classification using the kN N algorithm which assigns...

Note : Every thing explained clearly in the question plz answer correctly WRITE THE PROGRAM IN "JAVA " | Programming Assignment 2: Group project (2 students) In many hazardous operations where humans...

Lab 6: Random Expressions This lab deals with Expression Trees. For background make sure you are familiar with material in section 9.4.3 of the textbook. The basic idea here is that we imagine that...

I did this assignment to complete a KNN classifier, but I'm having trouble to identify what's wrong with my code, specifically when trying to use the model with 3 neighbors. This is the title of the...

Figure gives the energy levels for an electron trapped in a finite potential energy well 450eV deep. If the electron is in the n = 3 state, what is its kineticenergy? -Nonquantized - Top of well 450...

The following information is available for Lock-Tite Company, which produces special-order security products and uses a job order costing system. April 30 May 31 Inventories Raw materials $40,000...

of Audicing 3 8 of 3 7 Who is responsible for establishing auditing standards for privately held compantes? A . Public Company Accounting Oversight Board B . Securities and Exchange Commission C ....

A rich relative has bequeathed you a growing perpetuity. The first payment will occur in a year and will be $3,000. Each year after that, you will receive a payment on the anniversary of the last...

What is the default Aggregation Method in SQL Server Analysis Services in Cube Processing? What are the other options?

What is the default Aggregation Method in SQL Server Analysis Services in Cube Processing? What are the other standard optional methods?

Before starting an SQL Server Analysis Services Multidimensional Modeling Project, why is identification of a Data Source important?