Question: n this assignment, you will work with an Activity Recognition dataset to explore, preprocess, and apply various classification algorithms. You will build a Bayesian Classifier

n this assignment, you will work with an Activity Recognition dataset to explore, preprocess, and apply various classification algorithms. You will build a Bayesian Classifier from scratch and implement Kernel Discriminant Analysis

(

KDA

)

using different kernels. Additionally, you will apply multiple classifiers using the scikit

-

learn library on both the entire dataset and the PCA

-

reduced dataset, comparing their performance.

Part

1

: Data Preprocessing

Loading the Dataset:

Load the Activity Recognition dataset into a pandas DataFrame.Display the first few rows and check for any missing values.Handle missing values if necessary.

Train

-

Test Split:

Split the dataset into training and testing sets.Use an

80 / 20

split, where

80 %

of the data is used for training and

20 %

for testing.

Part

2

: Classification on the Entire Dataset

In this part, you will apply several classifiers to the entire dataset without any dimensionality reduction.

Bayesian Classifier

(

from scratch

)

Implement a Bayesian classifier by calculating the prior probabilities and modeling the likelihood with a Gaussian distribution.Use Bayes' Theorem to compute the posterior probability for each class and classify the test data.

Other Classifiers

(

using scikit

-

learn

)

: Apply the following classifiers from scikit

-

learn:

Support Vector Machine

(

SVM

)

Use both Linear SVM and Kernel SVM

(

RBF and Polynomial kernels

) .

-

Nearest Neighbors

(

KNN

)

Experiment with different values for k

.

Naive Bayes:

Implement Gaussian Naive Bayes using GaussianNB.

Linear Discriminant Analysis

(

LDA

)

Use LinearDiscriminantAnalysis.

Kernel Discriminant Analysis

(

KDA

)

Implement KDA using different kernels

(

RBF

,

Polynomial

) .

Evaluation:

Evaluate the performance of all classifiers on the test set using the following metrics:

AccuracyPrecisionRecallF

1 -

Score

Generate a confusion matrix for each classifier.

Part

3

: Dimensionality Reduction Using PCA

Principal Component Analysis

(

PCA

)

Apply PCA to the dataset to reduce its dimensionality.Choose the number of components based on the explained variance ratio

(

.

.,

retain

95 %

of the variance

) .

Visualize the first two principal components.

Kernel PCA

(

Optional

)

Experiment with Kernel PCA using different kernels

(

linear

,

RBF

,

polynomial

)

and visualize the results.

Part

4

: Classification on the PCA

-

Reduced Dataset

Now, apply the same classifiers used in Part

2

to the PCA

-

reduced dataset:

Bayesian Classifier

(

from scratch

)

Apply the Bayesian Classifier you built from scratch to the PCA

-

reduced data.

Other Classifiers

(

using scikit

-

learn

)

Apply the following classifiers on the PCA

-

reduced dataset:

Linear SVM and Kernel SVM

.

-

Nearest Neighbors

(

KNN

) .

Naive Bayes.Linear Discriminant Analysis

(

LDA

) .

Kernel Discriminant Analysis

(

KDA

) .

Evaluation:

Evaluate the classifiers again using the same metrics

(

accuracy

,

precision, recall, F

1 -

score

) .

Compare their performance with and without PCA.

Part

5

: Performance Comparison

Evaluation:

Create a summary table comparing the performance of all classifiers on the entire dataset and the PCA

-

reduced dataset.Discuss how the classifiers performed with and without PCA.

Comparison and Discussion:

Compare the Bayesian classifier built from scratch to its scikit

-

learn counterparts.Analyze the impact of PCA on the classifiers

performance.Discuss the effect of using different kernels in SVM and KDA.

Submission Instructions:

Submit your work in a Jupyter notebook named firstname

_

lastname.ipynb.

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Programming Questions!

MATHEMATICS FOR MACHINE LEARNING Marc Peter Deisenroth A. Aldo Faisal Cheng Soon Ong Contents Foreword 1 Part I Mathematical Foundations 9 1 Introduction and Motivation 11 1.1 Finding Words for...

1 Assignment 2 Latent Variables and Neural Networks Due Date: 21:59:59 23 May 2021 Please note that, 1. 1 sec delay will be penalized as 1 day delay. So please submit your assignment in advance...

No specific word count 1.What business problem are we trying to solve? Why is that problem important to the business? 2.How do I know this is a problem? 3.How did we get here? Critically assess the...

Hello, I was wondering if someone could help me with these because I'm pretty sure I am doing this all wrong :) is this clear enough? Autosa) 0 Sachat K File Home Insert Draw Page Layout Formulas...

ASSIGNMENT 4 1. Find the tension in cables AB, BC, CD and EC. E C D A B 30 125 kg 2. Find the forces in beams AB (FAB) and AC (FAC). B 2 m 4 m A 4 m im 75 KNASSIGNMENT 5 1. Find the reactions at A...

Hello i need help answering this question, I have attached part one, which I have ddid and figured out. After looking at part one please help me answer part two with the break downs. Part two is the...

Assignment 2 : sed For this assignment, you will use sed, bash, and some other command - line utilities to create a program for formatting C code. Your program should take a source code file as input...

use the python code provided CSE 231 Spring 2018 Programming Project 04 Edit on 2/3/18: clarified in the description of main0 that the first prompt is for the rotation, N This assignment is worth 40...

Chrome File Edit View History Bookmarks Profiles Tab Window Help 47% Sun 9:07 PM Q DE .. . N Student Links - Northern Virgin X Applications | Rapididentity X N 1.2 Assignment: Market Segmentades for...

NEED ANSWERED ASAP! EVERY TH ING IN YELLOW AND IN THE SAME CHART FORMAT Thank You! last picture is simply a zoomed out ceiw of the whole problem thank you again! THESE ARE CLEARER PICTURES USE THESE...

You are moving from New Brunswick to British Columbia. Your regular $65,000 salary will be subject to BC taxes. As is the case in Canada and the provinces the first $15,000 of your income will not be...

The Essex Company found that it takes an average of 10 days between the time customer payments are received and the deposited funds are cleared at the customer's bank and can be used by the company....

which of the following financial statements shows a review of the revenues and expenses that a company has incurred over a period of time? The Income Statement Statement of Retail Operations The...

Analyze the following perspectives and their claims, supporting reasons, and shared values. Then select the appeal and rationale for that appeal that would be most convincing to both perspectives. Per