Question: In binary classification, gradient descent is used to find the minimum of the error function. With error function defined as the binary cross entropy (

In binary classification, gradient descent is used to find the minimum of the error function. With error function defined as the binary cross entropy (:(1-rt)log(1-yt)} and yt=tanh(wTxt+w0), derive the update for wj and w0, where j0.
Note: Given y=tanh(), the derivative delydel=1-y2.
 In binary classification, gradient descent is used to find the minimum

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!