Question: ( 2 2 points ) In this next question, we'll use KNN to try to classify players' preferred foot. ( a ) ( 1 point

(22

points

)

In this next question, we'll use KNN to try to classify players' "preferred foot."

(

) (1

point

)

First, let's get a better sense of the balance of classes in our data

(

,

how

many observations of each class we have

) .

Display the count of each value present in the

preferred foot column.

(

) (2

points

)

If we were to build a classifier which always guessed a player preferred their

right foot, what percentage of the time would we make a correct classification? In other

words, what percent of players actually do prefer their right foot?

(

) (1

point

)

Let's build a classifier using

10

available dimensions: shooting, passing, drib

-

bling, defending, attacking, skill, movement, power, mentality, and goalkeeping. Create

x

dataframe with just these

10

columns and display the first

5

rows.

(

) (2

points

)

Now, rescale

(

or normalize

)

this

x

data so that each IV has a mean of

0

and

a standard deviation of

1 .

Display

(

at least

)

the first three rows of normalized data.

(

) (2

points

)

We'll want to be able to see how well our classifier performs out of sample,

so now create a validation

-

train split, setting

Y

to be the "preferred foot" column of the

dataframe. Here, use

30 %

of the data for validation and set the random state to

456 .

Display

(

at least

)

the first

3

rows of

x

training data.

(

) (4

points

)

Next, we'll want to determine the number of neighbors

k

to consider for our

KNN classifier. For values of

k

from

1 - 30 (

inclusive

),

calculate either the error or the

accuracy of a KNN classifier. Display your results by creating a plot with considered

k

values along the horizontal axis and the corresponding error

(

or accuracy

)

displayed

along the vertical axis.

(

) (4

points

)

Based on your analysis, choose a reasonable value of k

.

Fit a KNN classifier

that considers this number of neighbors and predict

Y

values

(

preferred foot

)

for your

out of sample validation data. Display

(

at least

)

the first

3

predictions for "preferred

foot."

(

) (2

points

)

Use actual and predicted Y values to calculate and display the confusion

matrix for your model. This will display without labels, but will show the classes in

alphabetical order

(

Left

,

Right; upper left corner is "Left

-

Left"

) .

As with the examples

in lecture, the rows will indicate the actual values and the columns will indicate the

predicted values. Approximately how many players who actually prefer their left foot

("

True Lefts"

)

were predicted to prefer their right foot?

(

) (2

points

)

Use the actual and predicted Y values to display the full classification report.

What does the recall for the classification "Left" suggest about our model?

(

) (2

points

)

Reflecting on the analysis above, do you feel like this model does a good job

or a bad job of predicting a player's preferred foot? Briefly explain your answer.

( 2 2 points ) In this next question, we'll use

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Accounting Questions!

3 COLLEGE ALGEBRA - TRIGONOMETRY Business and Finance (MAT115) This course will start with a review of basic algebra (factoring, solving linear equations, and equalities, etc.) and proceed to a study...

I did this assignment to complete a KNN classifier, but I'm having trouble to identify what's wrong with my code, specifically when trying to use the model with 3 neighbors. This is the title of the...

I have literally posted the complete assignment information, can I please have some help with these two problems? MAT 375 Module Two Guided Activity: (Continuous) Dynamical Systems Our textbook has a...

\fHow to Keep A PLAYERS Productive Just because they think they're great doesn't mean they're not. YEL MAG CYAN BLACK by Steven Berglas A JOHN RITTER fter graduating from Harvard Business School with...

I need help on the Boulder Public School Case Study . There are three questions asked at the end of the case study. HBSP Product Number TCG239. Rev. Oct. 2015 THE CRIMSON PRESS CURRICULUM CENTER THE...

will require integrating human resource management theory and thought (as described in the textbook) while addressing major issues in the field and therefore will require proper referencing and...

Use Python to complete the following code. 3. Classification and K Nearest Neighbor Download and read the document that explains how to apply the K Nearest Neighbor (KNN) algorithm to classify data....

(mba 5900) (please type the answer to question number 3 only question number 3 needs to be answered)( please use a plagarize checker) only question 3 needs to be answered Case questions: Under...

Planning is one of the most important management functions in any business. A front office managers first step in planning should involve determine the departments goals. Planning also includes...

In the opening of the case, Randall Parman of Applebees International compared data to gold. Although it is easy to figure out the value of gold at any time, valuing data has always been subject to...

Consider an application that transmits data at a steady rate (for example, the sender generates an N-bit unit of data every k time units, where k is small and fixed). Also, when such an application...

9. Show the expanded code produced by the following statement that invokes the mWrite- String macro from Section 10.2.5: mWriteStr namePrompt

Pharoah Company reported the following amounts for 2022: Raw materials purchased $95,200 Beginning raw materials inventory 5,824 Ending raw materials inventory 5,040 Beginning finished goods...