Question: Project Description: In this project, you will design two classifiers: a naive Bayes classifier, and a perceptron classifier. You will test your classifiers on two

Project Description:

In this project, you will design two classifiers: a naive Bayes classifier, and a perceptron classifier. You will test your classifiers on two image data sets: a set of scanned handwritten digit images and a set of face images in which edges have already been detected. Even with simple features, your classifiers will be able to do quite well on these tasks when given enough training data.

Optical character recognition

(

OCR

)

is the task of extracting text from image sources. The first data set on which you will run your classifiers is a collection of handwritten numerical digits

(0 - 9) .

This is a very commercially useful technology, similar to the technique used by the US post office to route mail by zip codes. There are systems that can perform with over

99 %

classification accuracy

(

see LeNet

- 5

for an example system in action

) .

Face detection is the task of localizing faces within video or still images. The faces can be at any location and vary in size. There are many applications for face detection, including human computer interaction and surveillance. You will attempt a simplified face detection task in which your system is presented with an image that has been pre

-

processed by an edge detection algorithm. The task is to determine whether the edge image is a face or not.

0123456789

0223456789

0123456789

Which Digit?

What you should do:

Implement two classification algorithms for detecting faces and classifying digits:

Which Digit?

Face or not face?

Figure

1

: Examples of the data points in the data set.

(

)

Naive Bayes Classifier

(

)

Perceptron

2 .

Design the features for each of the two problems, and write a program for extracting the features from each image.

3 .

Train the algorithms on the part of the data set that is reserved for training. First, use only

10 %

of the data points that are reserved for training, then

20 %, 30 %, 40 %, 50 %, 60 %, 70 %, 80 %, 90 %,

and finally

100 % .

All the results should a function of the number of data points used for training.

4 .

Compare the performances of the two algorithms using the part of the data set that is reserved for testing, and report:

The time needed for training as a function of the number of data points used for training.

The prediction error

(

and standard deviation

)

as a function of the number of data points used for training.

Write a report describing the implemented algorithms and discussing the results and the learned lessons.

Please keep in mind that:

You should implement yourself these two algorithms as well as the feature extraction part.

Your algorithm should not look at the testing data before the training is over. If you use any testing data point for training, that would be considered as cheating.

Project Description: In this project, you will

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Programming Questions!

In this project, you will design two classifiers: a naive Bayes classifier, and a perceptron classifier. You will test your classifiers on two image data sets: a set of scanned handwritten digit...

Assignment for module 6 In this assignment, you are required to implement a document classifier using Nave Bayes algorithm with your favorite programming language. You will use the provided training...

Please HELP ! This is a python programming question: Please read the instructions well. Give your own answer. Put a screenshot of the code you made. The assignment needs to be understood. Because of...

What is the best conclusion of this whole output, work, code, analysis, and articulation? Unit 8 Individual Project John Xian May 28, 2025 setwd("~/Documents/JerryS/") library(data.table) airline_data

Data Science, Python, Jupyter Notebook I have a term project for my Capstone class in Data Science. Below is the syllabus, dataset, and the Jupiter Notebook. I am creating a Classification model to...

Developments in Technology Light is incident from air on the end face of a multimode optical fibre at angle of incidence as shown below. n n 1 2 The refractive indices of the core and cladding are...

io (a) Give the general formula for estimating transition probabilities from training data. Provide the full transition matrix A for this HMM based on the training data shown. [6 marks] (b) Give the...

Linear classification Assume that we wish to classify a data vector y i n R 2 into two classes. For this purpose, you design two classifiers, namely linear classifier with least square and logistic...

3.11 Exis 191 Table 3.7. Comparing the test c ry of decision trees Trand T1 Accuracy Data Set TOT 0.86 0.97 0.77 0.84 12. Consider a labeled data set containing 100 data instances, which is randomly...

You can use any software to plot and/or to calculate values/data, but if you do, provide (copy/paste) here the code. Data sets relevant for this HW can be found at the UCI Machine Learning...

Individual A is about to acquire 30% of the shares of a new corporation (a Canadian-controlled private corporation) that will carry on an active business. The remaining 70% of the shares will be...

The financial statements of Apple Inc. are presented in Appendix A. Instructions for accessing and using the companys complete annual report, including the notes to the financial statements are also...

In the current year, the not - for - profit organization Save the Butterflies Foundation received cash of $ 5 0 0 to be used as the Foundation wishes and $ 1 , 0 0 0 to be used for butterfly...

Q1. Evaluate each definite integral. Give exact answers. 1) Jo e5x dx 5 2 dx 3) S2 2sin(2x) dx 4) J-271 2# (3sec2 x - 3tan2x) dx 5) f* excosix+xsin x dx 6) " (sinx + cosx)2dx dx Q2. Show that S =...