Question: n this programming assignment, please write codes from scratch ( do not use preexisting libraries from anywhere ) for ( 1 ) linear models and

n this programming assignment, please write codes from scratch

(

do not use preexisting libraries from

anywhere

)

for

(1)

linear models and

(2)

gradient descent algorithm.

Task

1 .

Classifying MNIST data

Use the MNIST data provided along with this assignment. There are two files: MNIST

_

training

_

1 .

csv

and MNIST

_

test

_

1 .

csv

.

In those two datasets, there are

95

and

50

samples respectively for each

label of

0

and

1 .

consequently, we will solve a binary classification problem.

You will train a linear regression model using the training data

(

MNIST

_

training

_

1 .

csv

)

and will

compute accuracy with the test data

(

MNIST

_

test

_

1 .

csv

) .

For Task

1,

please follow the procedure:

1 .

Train a linear regression model

Find the optimal parameters

(\

theta

)

by the Normal Equation:

\

theta

= (

X T X

) 1

XT y

.

It will cause an

error

/

warning

.

You can use

numpy

.

linalg.pinv

pinv

from MATLAB for matrix inverse. The function

computes Moore

Penrose pseudo

inverse X

+

which will further be used to find valid optimal

parameters using

\

theta

* =

+

.

This will be a solution to rank

deficient degenerate system.

2 .

Display the optimal coefficients

(

denoted by

\

theta

*)

3 .

Classify

(

binary

)

test data

(

MNIST

_

test.csv

)

with a threshold

(< =)

0.5

as described below:

y pred

=

X test

\

theta

*

if y pred

> 0.5,

class

1,

otherwise

0

4 .

Display the

%

accuracy.

Task

2 .

Implementation of Gradient Descent with MNIST data

For Task

2,

we will use the same data as Task

1 .

However, we will find the optimal coefficients by using

"Gradient Descent" algorithm. Then, we will compare with the solution that we found in Task

1 .

The procedure of Task

2

is almost the same as Task

1,

but need to implement "Gradient Descent"

algorithm, instead of a least

square minimization based solution of single line equation as

(

X T X

) 1

X T y or

more appropriately P inv

.

For the Gradient Descent algorithm, please follow the procedure:

1 .

Set the initial coefficient to zeros

(

can be any random values though

)

Think of what the dimension of the coefficient vector is

?

2 .

Determine hyper

parameters such as learning rate

(\

alpha

)

and iteration numbers

(

) .

3 .

Run "gradient descent" algorithm with the hyper

parameters and check "Learning Curve" as

shown:

*

Learning curve shows whether it converges or not. X

axis shows the number of iterations,

while y

axis shows cost

(

) .

*

Learning curve must be showing as "converged", otherwise the solution may not be good.

4 .

Display the estimated coefficients

(

denoted by

\

theta

^)

5 .

Classify test data

(

MNIST

_

test.csv

)

with a threshold

(< =)

0.5

as described below:

ypred

=

Xtest

\

theta

^

if ypred

> 0.5,

class

1,

otherwise

0

6 .

Display the

%

accuracy.

7 .

Display the aggregate difference between

\

theta

*

and

\

theta

^

as defined below:

|^|

Submission:

Please submit the following to the D

2

L by the stipulated deadline therein:

1 .

Summary: MS Word or PDF file summarizing your results, and discussion only.

Describe what you did, and how, as well as your important results.

2 .

Code: one Python

/

MATLAB

/

Octave

/

/

Java

/

+ +

original executable source code file, or a zip

file if

multiple code files with clear running

ReadMe

file.

Must be well organized

(

comments

,

indentation,

. . .)

3 .

Code PDF: also, upload the PDF version of the original source code file

(

),

in case we would just like

to have a look at the code but don

t want to run it

.

Please submit these THREE files SEPERATELY. DO NOT compress into a ZIP file.

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Programming Questions!

Jupiter Notebook We have covered some of the limitations of single layer neural networks in class, but they are still powerful learning systems that provide a good way to begin learning about how to...

Solve all parts with code The google colab code/file is : { "cells": [ { "cell_type": "markdown", "metadata": {}, "source": [ "# Linear Regression for Red Wine Quality Classification" ] }, {...

MATHEMATICS FOR MACHINE LEARNING Marc Peter Deisenroth A. Aldo Faisal Cheng Soon Ong Contents Foreword 1 Part I Mathematical Foundations 9 1 Introduction and Motivation 11 1.1 Finding Words for...

Python language please. Thank you! #%% md # Setup First, let's import a few common modules, ensure MatplotLib plots figures inline and prepare a function to save the figures. We also check that...

Hello, below are the details for a LINEAR REGRESSION assignment. I would like to train 10 univariate linear regression models and return the best one. For the end, I also need to include a graph that...

please provide the code in racket programming language. I only need answer for question1. Programming Rules Submit one Racket file named as hw 3-net-id).rkt. lang racket (provide (all-defined-out))...

(a) In SystemVerilog, what is the difference between: (i) The ternary operator ? and if...then...else statements? [2 marks] (ii) always_ff and always_comb? [2 marks] (iii) Blocking, non-blocking and...

CS 7641 CSE/ISYE 6740 Homework 3 Le Song Deadline: 11/07 Mon, 11:55pm Submit your answers as an electronic copy on T-square. No unapproved extension of deadline is allowed. Zero credit will be...

Below is the code that may be needed. Below is the code that teacher required.Necessary requirements: You must make a diagram, and then you can use matlab to run it.DIAGRAM and MATLAB In this task,...

In C only This assignment is designed to provide you some experience writing programs with the C programming language and to get a glimpse of how machine learning works. There is significant hype and...

Based on the two tables what can you say about this portfolio manager? Based on the two tables above, what must have been the strongest point of this portfolio manager's performance?

Below is the data for the maximum production of the only two products for the countries of Oz and Zas. 26 food units 40 equipment units 52 food units or 26 equipment units 021 Zas: a. If each country...

What was the spot exchange rate of South Africa rand for U . S . dollars ( USD per ZAR ) on January 9 , 2 0 2 3 ?

Venn diagrams can help you organize your ideas. Look at the lists below and choose one text from list A and one text from list B . Create a Venn diagram to compare and contrast the themes in these two