Question: Question 1 ( 6 0 % ) MUST have comprehensive code, thorough maths with explanations. In this exercise, you will train many multilayer perceptrons (

Question

1 (60 %)

MUST have comprehensive code, thorough maths with explanations.

In this exercise, you will train many multilayer perceptrons

(

MLP

)

to approximate the class

label posteriors, using maximum likelihood parameter estimation

(

equivalently

,

with minimum

average cross

-

entropy loss

)

to train the MLP

.

Then, you will use the trained models to approximate

a MAP classification rule in an attempt to achieve minimum probability of error

(

.

.

to minimize

expected loss with

0 - 1

loss assignments to correct

-

incorrect decisions

) .

Data Distribution: For C

= 4

classes with uniform priors, specify Gaussian class

-

conditional

pdfs for a

3 -

dimensional real

-

valued random vector x

(

pick your own mean vectors and covariance

matrices for each class

) .

Try to adjust the parameters of the data distribution so that the MAP

classifier that uses the true data pdf achieves between

10 % - 20 %

probability of error.

MLP Structure: Use a

2 -

layer MLP

(

one hidden layer of perceptrons

)

that has P perceptrons

in the first

(

hidden

)

layer with smooth

-

ramp style activation functions

(

.

.,

ISRU, Smooth

-

ReLU,

ELU, etc

) .

At the second

/

output layer use a softmax function to ensure all outputs are positive

and add up to

1 .

The best number of perceptrons for your custom problem will be selected using

cross

-

validation.

Generate Data: Using your specified data distribution, generate multiple datasets: Training

datasets with

100, 500, 1000, 5000, 10000

samples and a test dataset with

100000

samples. You

will use the test dataset only for performance evaluation.

Theoretically Optimal Classifier: Using the knowledge of your true data pdf

,

construct the

minimum

-

probability

-

-

error classification rule, apply it on the test dataset, and empirically esti

-

mate the probability of error for this theoretically optimal classifier. This provides the aspirational

performance level for the MLP classfier.

Model Order Selection: For each of the training sets with different number of samples,

perform

10 -

fold cross

-

validation, using minimum classification error probability as the objective

function, to select the best number of perceptrons

(

that is justified by available training data

) .

Model Training: For each training set, having identified the best number of perceptrons using

cross

-

validation, using maximum likelihood parameter estimation

(

minimum cross

-

entropy loss

)

train an MLP using each training set with as many perceptrons as you have identified as optimal

for that training set. These are your final trained MLP models for class posteriors

(

possibly each

with different number of perceptrons and different weights

) .

Make sure to mitigate the chances

of getting stuck at a local optimum by randomly reinitializing each MLP training routine multiple

times and getting the highest training

-

data log

-

likelihood solution you encounter.

Performance Assessment: Using each trained MLP as a model for class posteriors, and using

the MAP decision rule

(

aiming to minimize the probability of error

)

classify the samples in the test

set and for each trained MLP empirically estimate the probability of error.

Report Process and Results: Describe your process of developing the solution; numerically

and visually report the test set empirical probability of error estimates for the theoretically opti

-

mal and multiple trained MLP classifiers. For instance show a plot of the e mpirically estimated

test P

(

error

)

for each trained MLP versus number of training samples used in optimizing it

(

with

semilog

-

x axis

),

as well as a horizontal line that runs across the plot indicating the empirically

estimated test P

(

error

)

for the theoretically optimal classifier.

Note: You may use software packages for all aspects of your implementation. Make sure you

use tools correctly. Explain in your report how you ensured the software tools do exactly what you

need them to do

.

Question 1 ( 6 0 % ) MUST have comprehensive

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Programming Questions!

I have an assignment where you need an experiment design study from Golub study. (NEED TO USE R PROGRAMMING). The script of the R program for the assignment is below. The assignment is attached...

Journal of Autism and Developmental Disorders, Vol. 32, No. 3, June 2002 ( 2002) Descriptive Epidemiology of Autism in a California Population: Who Is at Risk? Lisa A. Croen,1,3 Judith K. Grether,1...

BMGT 488 Assignment Grading Rubrics Project Descriptions Always submit your best, most thoughtful work. Assignments should be well organized and should demonstrate the level of writing expected of...

ECON 2035 (Online 2nd Fall 2022) Money/Bank (1) Rebecca Odle 11/11/221:50 PM C?) _ Homework: Chapter10 Question list 0 Question 1 0 Question 2 0 Question 3 0 Question 4 0 Question 5 0 Question 6 0...

The functions f and g are defined by the following tables. g(x) Question list K x f ( x ) X Use the tables to evaluate the given composite function. 2 -5 -6 f(g(7)) 4 0 2 O Question 9 5 5 2 -5 2 O...

In Midterm Exam X X C O & https://courses.yorkvilleu.ca/mod/quiz/attempt.php?attempt=240698&page=1 Close Sidebar B74 MATH0910-22F-O-B E YORKVILLE Q AskYU MyYU Participants Competencies MATH 0910:...

Grade = 100.00 Question Worth Points Lost 1a 1b 1c 1d 1e 1f 1g 1h 1i 2a 2b 2c 2d 2e 2f 2g 3a 3b 3c Total 4 6 4 6 2 4 6 6 12 4 6 6 4 4 6 4 6 4 6 100 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00...

What are the excel formulas for Excel question #2-4 (Bordered in RED)? A B D E F G 1 J K L M N 0 Q R S T U V W 1 Male Female Total 2 Email ID Time of Day Offer S1 S2 S3 Delivered Opened Open Rate...

X ? Homework: Homework 3(new) Question 1, 7.1.35 HW Score: 0%, 0 of 20 points O Points: 0 of 1 Save Question list K Find the indicated value of the function f of a single variable. f(y) = M(y,y) for...

A manager of a manufacturing company is considering adding either a drilling machine or a knurling machine. The life cycle of the drilling machine follows a uniform distribution with parameters (2.1,...

The equation of the tangent plane to the surface In (2) = 2(x - 2y) + 32 +3 at (4, 2, -1) is (a) -3x+6y - 12z - 12 = 0 (b) 3r6y-12-12-0 (c) 3x6y + 12z - 12 = 0. (d) -3x - 6y + 12% + 12 = 0

27. Bailey Company (buyer) and Suarez, Inc. (seller), engaged in the following transactions during January 20X1: INSTRUCTIONS 1. Open the accounts payable ledger account and accounts receivable...

A new sweater brand is trying to decide on a positioning. They want to use the laddering technique and position their brand at a high abstract value level. Which of the following positioning strategie