Question: L 2 regularization: J R ( w ) = J ( w ; D ) + | | w | | 2 2 L 1
regularization:
;
regularization:
;
where ; is the original cost function cost function without regularization for training
of a general parametric ML model.
Justify the following facts.
a regularization pushes all parameters towards small values but not necessarily exactly
zero
b tends to favor socalled "sparse" solutions, where only a few of the parameters are
nonzero, and the rest are exactly zero.
Step by Step Solution
There are 3 Steps involved in it
1 Expert Approved Answer
Step: 1 Unlock
Question Has Been Solved by an Expert!
Get step-by-step solutions from verified subject matter experts
Step: 2 Unlock
Step: 3 Unlock
