Question: Let's consider a CNN based architecture designed to classify an image into one of the three classes - a pedestrian, a tree or a traffic

Let's consider a CNN based architecture designed to classify an image into one of the three classes - a pedestrian, a tree or a traffic signal. Each input image is of size (512,512,3)(RGB).
The network contains the following 11 layers in order. Note that we will address the input layer as the first layer, the next conv layer as the second layer and so on (i.e. according to the numbers).
1. Input image (512,512,3)
2. Convolution: 325x5 filters, stride 's1', padding 'p1'
3. Convolution: 323x3 filters, stride 1, padding 1
4, Max Pooling: 2x2 filter, stride 2
5. Convolution: 643x3 filters, stride 1, padding 1
6. Convolution: 643x3 filters, stride 1, padding 1
7. Max Pooling: 2x2 filter, stride 2
8. Layer 'l'
9. Fully-connected: 4096 neurons
10. Fully-connected: 512 neurons
11. Fully-connected: 'F' neurons
Question 2/5
Mandatory
Comprehension
The 8th layer is named layer 'l'. Which of the following types could be the layer 'l'?

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!