Consider a convolutional neural network that is used to classify images into two classes The structure of the network is as follows INPUT 1 0 0 x 1 0 0 grayscale images LAYER 1 Convolutional layer with 1 0 0 5 x 5 convolutional filters LAYER 2 Convolutional layer with 1 0 0 5 x 5 convolutional filters LAYER 3 A max pooling layer that down samples Layer 2 by a factor of 4 ( from 1 0 0 x 1 0 0 5 0 x 5 0 ) LAYER 4 Dense layer with 1 0 0 units LAYER 5 Dense layer with 1 0 0 units LAYER 6 Single output unit ( a ) How many weights does this network have ( b ) What is stochastic about stochastic gradient descent ( c ) Assume this network is trained using stochastic gradient descent on a fixed training set Is there a guarantee that the algorithm will converge to an optimal set of weights given enough epochs of training ( d ) As with any learning algorithm, designers of deep neural networks face a bias variance trade off Given your answer to part ( a ) , would you say that high bias or high variance is a bigger concern How might we address the concern In Figure 1 we see that the ordering of the three methods with respect to mean absolute error is different from the ordering with respect to test set R 2 How can this be

The Answer is in the image, click to view ...

Question: Consider a convolutional neural network that is used to classify images into two classes. The structure of the network is as follows: INPUT: 1 0

Consider a convolutional neural network that is used to classify images into two classes. The structure of the network is as follows:

INPUT:

100

100

grayscale images.

LAYER

1

: Convolutional layer with

100 5

5

convolutional filters.

LAYER

2

: Convolutional layer with

100 5

5

convolutional filters.

LAYER

3

: A max pooling layer that down

-

samples Layer

2

by a factor of

4 (

from

100

100 - > 50

50)

LAYER

4

: Dense layer with

100

units

LAYER

5

: Dense layer with

100

units

LAYER

6

: Single output unit

(

)

How many weights does this network have?

(

)

What is

stochastic

about stochastic gradient descent?

(

)

Assume this network is trained using stochastic gradient descent on a fixed training set. Is there a guarantee that the algorithm will converge to an optimal set of weights given enough epochs of training?

(

)

As with any learning algorithm, designers of deep neural networks face a bias

/

variance trade

-

off. Given your answer to part

(

),

would you say that high bias or high variance is a bigger concern? How might we address the concern? In Figure

1

we see that the ordering of the three methods with respect to mean absolute error is different from the ordering with respect to test set R

2 .

How can this be

?

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Programming Questions!

(a) (5 pts) Consider the following neural network: f(1;0) = A30(A20(A11 +61) + b2) +b3, (1) where A; are matrices, b are vectors, i = 1,2,3,, the symbol 0 := {A;, bi}}{1, and o is a nonlinear...

Template code below: ###add lines to import modules as needed ## def build _ model 1 ( ) : model = None # Add code to define model 1 . return model def build _ model 2 ( ) : model = None#add code to...

Need the answer very fast. I'm in the exam hall. I'll upvote. Please do it fast. In the context of a Convolutional Neural Network (CNN) for image classification of animals, consider an image with a...

solve this. In the context of a Convolutional Neural Network (CNN) for image classification of animals, consider an image with a size of 55 pixels, and a filter of 33 dimensions has been provided...

We illustrate the model structure in Figure. 1 . The two main components of OpenVoice are the base speaker TTS model and the tone color converter. The base speaker TTS model is a single - speaker or...

Jupyter Notebook Now that we have tried our hand at some single-layer nets, let's see how they stack up compared to multi-layer nets. :) We will be exploring the basic concepts of learning non-linear...

QUIZ... Let D be a poset and let f : D D be a monotone function. (i) Give the definition of the least pre-fixed point, fix (f), of f. Show that fix (f) is a fixed point of f. [5 marks] (ii) Show that...

1 . Load the image dataset: load the Digits dataset from a predefined path in the MATLAB toolbox. The imageDatastore function handles the loading of image data. All subfolders are included and folder...

Data Processing As you could see in the above plot, the images are grayscale images have pixel values that range from 0 to 2 5 5 . Also, these images have a dimension of 2 8 x 2 8 . As a result,...

Implementing a Performance Management Communication Plan at Accounting, Inc. Accounting, Inc. is a consulting and accounting firm headquartered in Amsterdam, the Netherlands. Recently, Accounting,...

Substrate-level phosphorylation is the only method of producing ATP during (select all that apply) A. oxidative phosphorylation B. citric acid cycle C. glycolysis

Exercise 7-38 Contribution Margin Ratio, Variable Cost Ratio, Break-Even Sales Revenue The controller of Jeong Company prepared the following projected income statement: Sales $92,000 Total variable...

Solve the matrix equation Ax b where 9 61 1 x 9 A The solution is x1 4 x2 4 6 and b 66 36