Question: Q 1 ( 6 0 pts ) : Consider a binary classification problem where we use logistic regression to predict the probability that a given
Q pts: Consider a binary classification problem where we use logistic regression to
predict the probability that a given input xi nRn belongs to class The model is defined
as:
hatysigma wTTxb
where sigma zez is the sigmoid function, winRn is the weight vector, and b is the
bias term.
The loss function used to train the model is the crossentropy loss, defined for a single
training example xy as:
Lwbyloghatyyloghaty
Given a training dataset xiyiim the overall loss is the average crossentropy loss:
Jwbmsumim Lwb;xiyi
Questions:
a Derive the gradients of the loss function Jwb with respect to the parameters w and
b Show all steps in your derivation. points
b Write the update equations for w and b using gradient descent. Assume the learning
rate is alpha points
c Suppose you have a dataset with three training examples and the current values of the
parameters are w and b The learning rate alpha is set to Given
the following dataset:
Calculate the gradients and update the parameters w and b after one step of gradient
descent. points
Step by Step Solution
There are 3 Steps involved in it
1 Expert Approved Answer
Step: 1 Unlock
Question Has Been Solved by an Expert!
Get step-by-step solutions from verified subject matter experts
Step: 2 Unlock
Step: 3 Unlock
