Question: Can you answer part 2 and part 3 multiple choice parts of this question in the image? Thank you Given a temporal CNN with one

Can you answer part 2 and part 3 multiple choice parts of this question in the image? Thank you
Given a temporal CNN with one hidden layer. The state at time t is calculated as:
St=U0xt-1+U1xt
And the output at time t is calculated as:
yt=V0St-1+V1St
Q1
Train this temporal CNN with the squared error loss function:
et=12(hat(y)t-yt)2
on the dataset:
x0=1,y0=3,x1=2,y1=6,x2=3,y2=9
Assume that U0=U1=V0=V1=1. At time t=2 what is the gradient of the loss function e2, evaluated
at the assumed values, with respect to U0? Please round your answer to one decimal place.
Q2
Let's now train multiple layers of temporal convolutions (10 in total) with dilation on a large dataset where each
data point has T=1000 timesteps. All temporal convolutions are of size 2 and dilated with 2k for layers
k=0dots9. How many matrix multiplications does the gradient go through from lT to x0? Consider the most
efficient implementation.
logT
T
T2
T2
Q3
Let's now dilate slightly differently. All temporal convolutions are of size 2 and dilated with k+1 for layers
k=0dots. We increase the number of layers until the receptive field is larger than 1000. How many matrix
multiplications does the gradient go through from lT to x0? Consider the most efficient implementation.
logT
T
T2
T2
Can you answer part 2 and part 3 multiple choice

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Programming Questions!