Question: In this project, you will design two classifiers: a naive Bayes classifier, and a perceptron classifier. You will test your classifiers on two image data

In this project, you will design two classifiers: a naive Bayes classifier, and a perceptron classifier. You will test your classifiers on two image data sets: a set of scanned handwritten digit images and a set of face images in which edges have already been detected. Even with simple features, your classifiers will be able to do quite well on these tasks when given enough training data.

Optical character recognition

(

OCR

)

is the task of extracting text from image sources. The first data set on which you will run your classifiers is a collection of handwritten numerical digits

(0 - 9) .

This is a very commercially useful technology, similar to the technique used by the US post office to route mail by zip codes. There are systems that can perform with over

99 %

classification accuracy

(

see LeNet

- 5

for an example system in action

) .

Face detection is the task of localizing faces within video or still images. The faces can be at any location and vary in size. There are many applications for face detection, including human computer interaction and surveillance. You will attempt a simplified face detection task in which your system is presented with an image that has been pre

-

processed by an edge detection algorithm. The task is to determine whether the edge image is a face or not.

0123456789

0223456789

0123456789

Which Digit?

What you should do:

Implement two classification algorithms for detecting faces and classifying digits:

Figure

1

: Examples of the data points in the data set.

(

)

Naive Bayes Classifier

(

)

Perceptron

2 .

Design the features for each of the two problems, and write a program for extracting the features from each image.

3 .

Train the algorithms on the part of the data set that is reserved for training. First, use only

10 %

of the data points that are reserved for training, then

20 %, 30 %, 40 %, 50 %, 60 %, 70 %, 80 %, 90 %,

and finally

100 % .

All the results should a function of the number of data points used for training.

4 .

Compare the performances of the two algorithms using the part of the data set that is reserved for testing, and report:

The time needed for training as a function of the number of data points used for training.

The prediction error

(

and standard deviation

)

as a function of the number of data points used for training.

Write a report describing the implemented algorithms and discussing the results and the learned lessons.

In this project, you will design two classifiers:

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Programming Questions!

Project Description: In this project, you will design two classifiers: a naive Bayes classifier, and a perceptron classifier. You will test your classifiers on two image data sets: a set of scanned...

(REALLY NEED HELP CREATING THIS CODE IN FULL AND ITS COMPLETE ENTIRETY... ALL OF THE DETAILS ARE PROVIDED AND THE CODE SHOULD HAVE EACH PART FOR EACH QUESTION LABELED SEPARATELY... PLEASE HELP ME AND...

Assignment for module 6 In this assignment, you are required to implement a document classifier using Nave Bayes algorithm with your favorite programming language. You will use the provided training...

Please HELP ! This is a python programming question: Please read the instructions well. Give your own answer. Put a screenshot of the code you made. The assignment needs to be understood. Because of...

This question concerns a portrayal of English language messages outlined by a maker. (a) When investigating a text we are stressed over its sorts and tokens. (I) What are the sorts and unmistakable...

Assignment Exercise 2: PART A: Q1: Using a data set of your choice, do the following. Remember that the only pre-requisite before setting about with Naive classification is to have an existing set of...

Implement in C + + 1 1 a Naive Bayes Classifier that classifies individuals as Democrats or Republicans, using the 1 6 attributes and two classes from the Congressional Voting Records dataset Some of...

Implement a Naive Bayes Classifier that classifies individuals as Democrats or Republicans, using the 1 6 attributes and two classes from the Congressional Voting Records dataset Some of the features...

Please help me implement ( In C + + ) a Naive Bayes Classifier to classify individuals as either Democrats or Republicans using the 1 6 attributes and two classes from the Congressional Voting...

io (a) Give the general formula for estimating transition probabilities from training data. Provide the full transition matrix A for this HMM based on the training data shown. [6 marks] (b) Give the...

What is the ISO 27000 series of standards? Which individual standards make up the series?

Amber, a publicly held corporation, currently pays its president an annual salary of $900,000. In addition, it contributes $20,000 annually to a defined contribution pension plan for him. As a means...

A bond has a duration of eight years. If market interest rates rise by 1 % , the percentage price change of the bond is approximately A ) a 1 % decline. B ) an 8 % decline. C ) a 1 % increase. D ) an...

What is included in the Inspect and Adapt agenda? Quantitative and qualitative measurement System Demo ART Backlog refinement Management review and confidence vote