Question: Please help with this for deep learning!! CODE IN PHYTON. 2 Training Effects: Activation Functions, Optimizers, Batch Size Problem 5 : For the model structure

Please help with this for deep learning!! CODE IN PHYTON.

2

Training Effects: Activation Functions, Optimizers, Batch Size

Problem

5

: For the model structure you chose in Problem

4,

consider the problem of training, but with different batch sizes. Should you train with a small batch size, or a large batch size? Experiment with ten batch sizes

(

from

b = 1

b =

something sufficiently large

),

and plot

(

as a function of training time

),

training loss, testing loss, and clock time. What do you notice, and what does this say about a good choice of batch size?

Problem

6

: Repeat Problem

5,

but comparing Adam optimization vs SGD optimization. What are the tradeoffs? What determines a good stepsize?

Problem

7

: Consider Problem

5

again, but changing the underlying activation function for the model, in particular sigmoid vs tanh vs relu vs ELU. Can you draw any conclusions?

Bonus: What changes if you consider these experiments but run on a GPU?

3

CNNs vs Dense Layers

Problem

8

: Consider again the model you built in Problem

4 .

This was a relatively simple model with vanilla dense layers. Consider constructing a simple CNN model in the following way:

Pass the input image

(28 28 1)

into a convolutional layer and an activation function.

Flatten the result.

Pass the flat result into a number of dense layers

(

and activation functions

) .

Pass the result through a softmax layer to get class probabilities.

With a single convolutional layer

(

kernel size and number at your discretion

),

find the smallest model you can

(

in terms of total number of parameters

)

that ultimately matches or exceeds the performance of the model you found in Problem

4 .

How did you go about your neural architecture search to answer the question?

For the CNN architecture you find and the original architecture from Problem

4,

plot the training and testing loss over training time for comparable batch sizes, step sizes, and optimizer

(

be clear about the choices you are making

) .

Problem

9

: Consider Problem

8,

but you are allowed two stacked convolutional layers

(

of different kernel sizes

/

numbers

) .

Can you beat the network from Problem

8,

in terms of performance vs parameter count?

Bonus: For the dense model from Problem

4,

and the model from Problem

9,

find instances where the models fail

(

incorrectly classifying the image

) .

Are the mistakes being made reasonable, to your eye? Are the models making different kinds of mistakes?

Please help with this for deep learning!! CODE IN PHYTON. 2

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!

Please help with this for deep learning. CODE IN PHYTON. Problem 3 : For a given number of parameters P , let m P ( k ) be the number of nodes per layer such that Params ( k , m ) = P ( or as close...

Jupyter Notebook Now that we have tried our hand at some single-layer nets, let's see how they stack up compared to multi-layer nets. :) We will be exploring the basic concepts of learning non-linear...

Jupiter Notebook We have covered some of the limitations of single layer neural networks in class, but they are still powerful learning systems that provide a good way to begin learning about how to...

Hi, Can you please help me with assignment, I am failing to create the train_nn function. Please advise how I can get data to you, my previous efforts have failed. Tensorflow_NeuralNetworkspdf May 1,...

subject: Differential Equations pls read instructions do not use ai. drop all references and link Instructions ODE application. - find an article related to ODE application - provide a short...

Python and most Python libraries are free to download or use, though many users use Python through a paid service. Paid services help IT organizations manage the risks associated with the use of...

Data Processing As you could see in the above plot, the images are grayscale images have pixel values that range from 0 to 2 5 5 . Also, these images have a dimension of 2 8 x 2 8 . As a result,...

make notes in this manner atleast 15 point like confidentiality. follwing are material read and make notes make it within 2 hour please complete it as soon as possible simple this is the image of...

SUMMARIZE THE ARTICLE BELOW : Teams of people working together for a common purpose have been a centerpiece of human social organization ever since our ancient ancestors first banded together to hunt...

We are increasingly seeing new trends in application of emerging technologies, such as blockchain, audit analytics and continuous auditing, artificial intelligence and others in the public sector....

You paid back a loan with.A single payment of $1,575. If you borrowed 1,400 at an interest rate of 3% Compounded monthly. How many months was your loan outstanding?

Suffolk Industries prepares financials on a calendar year basis and sold on credit a tractor for a total price of $105,000 at the beginning of January 2017 which included both the manufacturers...

In the audit of Keps Bank, the auditor concluded that the assertion on the accuracy of cash is presented fairly in all material respects. However, the auditor missed testing three branches of Keps...

Can someone please, help me with the following: Auditing And assurance services (15th edition) Homework. Chapter 3: Problems 3-25(a-c), 3-28(a-c), and 3-31 Chapter 4: Problems 4-20(a-f), 4-23(a-d),...

4. Each key behavior is repeated. The trainee is shown the relationship between the behavior of the model and each key behavior.

5. Develop an evaluation package that includes evaluation of the trainee and evaluation of the self-directed learning package. Trainee evaluation should be based on the objectives (a process known as...

1. The display clearly presents the key behaviors. The music and the characteristics of the situation shown in the display do not interfere with the trainee seeing and understanding the key behaviors.