Question: 2 Recurrent Neural Networks [ 2 0 points ] Given the LSTM structure in Figure 1 1 and the corresponding definition in ( 1 1

2 Recurrent Neural Networks [20 points]
Given the LSTM structure in Figure 11 and the corresponding definition in (11).
([it
ft
ot
gt])=([
tanh])[([W1
W2
W3
W4])([ht-1
xt])]
ct=fto.ct-1+ito.gt
ht=oto.tanh(ct)
Let the loss of an LSTM model be L. Assume we have calculated delLdelit+1,delLdelft+1,delLdelot+1,
delLdelgt+1,delLdelct+1 and delLdelht+1. Derive gradient formulas for delLdelit,delLdelot,delLdelgt,delLdelct and delLdelht
(you can assume all gradients are scalars).
2 Recurrent Neural Networks [ 2 0 points ] Given

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Programming Questions!