Question: An artificial neural network ( ANN ) , especially in its modern form, is the underpinning technology that has enabled modern AI systems. A simple
An artificial neural network ANN especially in its modern form, is the underpinning technology
that has enabled modern AI systems. A simple ANN model, denoted by FAN N x W can be
formulated as the below:
FAN N x W
m
j
w
j x for any x in d
where W wj : j m with wj in d are fitting parameters of the ANN model and
: is called an activation function. We let m for this project activation function
Common choice of the activation function can be the linear function, x : x for all x in or
the ReLU function, x : max x for all x in
We often train an NN model to fit the training data available by solving the following optimiza
tion problem:
min
W:wj
N
N
i
FAN N xi W yi
m
j
wj
where xi yi i N is a sequence of training data inputs or designspredictors xi and
labels or response variables yi Here, v for any vector v vk is the norm of v; namely,
v
k vk
While simple, this ANN can be a powerful tool for many applications such as handwriting
recognition.
Problem Statements
Implementations in a programming language of your choice are asked to solve the following problems.
Training data will be provided in separate files An example for the implementation will be written
in both Matlab and Python.
Question for Student : Assume that x x and Train the ANN by solving
using a gradient decent algorithm a constant step size. Note that is not differentiable.
So some transformationderivation is needed to equivalently represent this problem to gain
differentiability.
Question for Student : Assume that xmax x and Train the ANN by
solving using a gradient decent algorithm with a constant step size.
Question for Student : Consider a modified formulation:
min
W:wj
N
N
i
FAN N xi W yi
m
j
wj
Assume that x x and Train the ANN by solving using a Newtons method.
Here, v for any vector v vk denotes the norm of v; namely, v
k v
k
Question for Student : Assume that x x Train the ANN by solving using the grid
search method.
Question for Student : Assume that x x Train the ANN by solving using stochastic
approximation that operates under the assumption that each iteration of the algorithm can
only access one pair of sample data.
Step by Step Solution
There are 3 Steps involved in it
1 Expert Approved Answer
Step: 1 Unlock
Question Has Been Solved by an Expert!
Get step-by-step solutions from verified subject matter experts
Step: 2 Unlock
Step: 3 Unlock
