Question: An artificial neural network ( ANN ) , especially in its modern form, is the underpinning technology that has enabled modern AI systems. A simple

An artificial neural network

(

ANN

),

especially in its modern form, is the underpinning technology

that has enabled modern AI systems. A simple ANN model, denoted by FAN N

(

,

),

can be

formulated as the below:

FAN N

(

,

) =

= 1

(

j x

),

for any x in

(1)

where W

= (

wj : j

= 1, . . .,

)

with wj in

d are fitting parameters of the ANN model and

is called an activation function. We let m

= 20

for this project

(20

activation function

) .

Common choice of the activation function can be the linear function,

(

)

=

x for all x in

,

the ReLU function,

(

)

=

max

{0,

}

for all x in

.

We often train an NN model to fit the training data available by solving the following optimiza

-

tion problem:

min

= (

)

1

2

= 1

(

FAN N

(

,

)

) 2 +

= 1

1 (2)

where

(

,

),

= 1, . . .,

,

is a sequence of training data inputs

(

or designs

/

predictors

)

xi and

labels

(

or response variables

)

.

Here,

1

for any vector v

= (

)

is the

1 -

norm of v; namely,

1 =

|

|

While simple, this ANN can be a powerful tool for many applications such as handwriting

recognition.

1.2

Problem Statements

Implementations in a programming language of your choice are asked to solve the following problems.

Training data will be provided in separate file

(

) .

An example for the implementation will be written

in both Matlab and Python.

Question for Student

1

: Assume that

(

) =

x and

= 0.01 .

Train the ANN by solving

(2)

using a gradient decent algorithm a constant step size. Note that

1

is not differentiable.

,

some transformation

/

derivation is needed to equivalently represent this problem to gain

differentiability.

Question for Student

2

: Assume that

(

) = (

max

{0,

}) 3

and

= 0 .

Train the ANN by

solving

(2)

using a gradient decent algorithm with a constant step size.

Question for Student

3

: Consider a modified formulation:

min

= (

)

1

2

= 1

(

FAN N

(

,

)

) 2 +

= 1

2

2 (3)

Assume that

(

) =

x and

= 0.3 .

Train the ANN by solving

(3)

using a Newton

s method.

Here,

2

for any vector v

= (

)

denotes the

2 -

norm of v; namely,

2 =

k v

2

.

1

Question for Student

4

: Assume that

(

) =

.

Train the ANN by solving

(2)

using the grid

search method.

Question for Student

5

: Assume that

(

) =

.

Train the ANN by solving

(2)

using stochastic

approximation that operates under the assumption that each iteration of the algorithm can

only access one pair of sample data.

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Programming Questions!

ITM 309: Business Information Technology and Systems Spring 2016 Watson and the new era of cognitive systems Jerry Haan IBM Cloud Ecosystem Development January 27, 2016 2013 International Business...

* write a Matlab code: Automatic Modulation Classification ( AMC ) plays a critical role in modern communication systems, enabling efficient signal processing and adaptation in dynamic environments....

URN 09/1026 DIGITAL BRITAIN Final Report JUNE 2009 DIGITAL BRITAIN - Final Report Published by TSO (The Stationery Office) and available from: Online www.tsoshop.co.uk Mail,Telephone, Fax & E-Mail...

Assistive technology enables dreams. Mathew Lee (personal communication) Assistive technology (AT) provides powerful tools used to diminish disability, enable activities of daily living (ADLs), and...

Python and most Python libraries are free to download or use, though many users use Python through a paid service. Paid services help IT organizations manage the risks associated with the use of...

1. Using examples from the case study critically examine the 5 forces which establish the rivalry in the global pharmaceutical industry. 2. Using examples from the case study critically examine the...

Which of the following describes the concept of 'many to one' relation in the context of circuits and functions? a . Only one circuit can produce a particular desired input - output function b . Many...

Answer the simulation questions 2-2 and 2-4 Simulation Question 2-4 This simulation question is based upon a true set of facts. The information contained in the simulation question was obtained from...

Could you please explain the findings of the study? A systematic review shows no performance benefit of machine learning over logistic regression for clinical prediction models Evangelia...

/ / mww _ linkedin.com / learning / paths / career - essentials - in - generative - ai - by - microsoft - and - linkedin / summative - exams / urn:lila _ assessmentV 2 : 5 9 6 0 9 3 1 5 ? u = 8 3 0 7...

Chinese workers earn only $.75 an hour; if we allow China to export as much as it likes, our workers will be forced down to the same level. You cant import a $10 shirt without importing the $.75 wage...

A 2.0-kg mass resting on a horizontal frictionless surface is connected to a fixed spring. The mass is displaced 16 cm from its equilibrium position and released. At t = 0.50s, the mass is 8.0 cm...

Simple average is a special case of WAVG. What would be the special case and why?

O and RT are parallel lines 0 R N P S ZRSU and ZTSU Q Which angles are adjacent angles ZRSU and ZTSP T ZRSU and ZOPS ZRSU and ZOPN