Question: In Exercise 12 above, we train the multi-logit classifier using a weight matrix (mathbf{W}) (in mathbb{R}^{3 times 7}) and bias vector (boldsymbol{b} in mathbb{R}^{3}). Repeat

In Exercise 12 above, we train the multi-logit classifier using a weight matrix \(\mathbf{W}\) \(\in \mathbb{R}^{3 \times 7}\) and bias vector \(\boldsymbol{b} \in \mathbb{R}^{3}\). Repeat the training of the multi-logit model, but this time keeping \(z_{1}\) as an arbitrary constant (say \(z_{1}=0\) ), and thus setting \(c=0\) to be a "reference" class. This has the effect of removing a node from the output layer of the network, giving a weight matrix \(\mathbf{W} \in \mathbb{R}^{2 \times 7}\) and bias vector \(\mathrm{b} \in \mathbb{R}^{2}\) of smaller dimensions than in (7.16).

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock

We modify the softm... View full answer

blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Statistical Techniques in Business Questions!