Question: Suppose that z is a vector with n elements. We would like to compute the gradient of y = softmax(z). Show that the Jacobian of
Suppose that z is a vector with n elements. We would like to compute the gradient of y = softmax(z). Show that the Jacobian of y with respect to z, J, is given by the Jij =yi/zj = yi(ij yj ) where ij is the Dirac delta, i.e., 1 if i = j and 0 else.
Step by Step Solution
There are 3 Steps involved in it
Get step-by-step solutions from verified subject matter experts
