Question: Consider the training dataset D = { ( x 1 , y 1 ) , . . . , ( xn , yn ) }

Consider the training dataset D ={(x1, y1),...,(xn, yn)} with n samples where xi in Rd1, yi in R for
all i in [n]. Let H in Rd2\times d1 be a fixed matrix, and let s : Rd2-> Rd2 be the softmax function, that ezi
is ith output s(z)i := Pd2 ezi . A sample xi from input dataset, goes through the transformations i=1
depicted in Figure 1. And our prediction yi = zi \theta . As a result, for a single sample (xi, yi), we can construct a regression loss function as:

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!