6 Risk Minimization with Doubt Suppose we have a classification problem with classes labeled 1 , dots,c and an additional doubt category labeled c 1 Let f R ( d ) 1 , dots,c 1 be a decision rule Define the loss function L ( f ( x ) , y ) ( 0 if f ( x ) y , f ( x ) in 1 , dots,c , ) , ( lambda ( c ) if f ( x ) y , f ( x ) in 1 , dots,c ) , ( lambda ( d ) if f ( x ) c 1 ) where lambda ( c ) 0 is the loss incurred for making a misclassification and lambda ( d ) 0 is the loss incurred for choosing doubt In words this means the following When you are correct, you should incur no loss When you are incorrect, you should incur some penalty lambda ( c ) for making the wrong choice When you are unsure about what to choose, you might want to select a category correspond ing to doubt and you should incur a penalty lambda ( d ) In lecture, you saw a definition of risk over the expectation of data points We can also define the risk of classifying a new individual data point x as class f ( x ) in 1 , 2 , dots,c 1 R ( f ( x ) x ) sum ( i 1 ) c L ( f ( x ) , i ) P ( Y i x ) ( a ) First, we will simplify the risk function using our specific loss function separately for when f ( x ) is or is not the doubt category i Prove that R ( f ( x ) i x ) lambda ( c ) ( 1 P ( Y i x ) ) when ii c 1 ii Prove that R ( f ( x ) c 1 x ) lambda ( d ) ( b ) Show that the following policy f ( opt ) ( x ) obtains the minimum risk ( R 1 ) Find the non doubt class i such that P ( Y i x ) P ( Y j x ) for all j , meaning you pick the class with the highest probability given x ( R 2 ) Choose class i if P ( Y i x ) 1 ( lambda ( d ) ) ( lambda ( c ) ) ( R 3 ) Choose doubt otherwise Hint In order to prove that f ( opt ) ( x ) minimizes risk, consider proof techniques that show that f ( opt ) ( x ) stays ahead of all other policies that don't follow these rules For example, you could take a proof by contradiction approach assume there exists some other policy, say f ( ' ) ( x ) , that minimizes risk more than f ( opt ) ( x ) What are the scenarios where the predictions made by f ( opt ) ( x ) and f ( ' ) ( x ) might differ In these scenarios, and based on the rules above that f ( opt ) ( x ) follows, why would f ( ' ) ( x ) not be able to beat f ( opt ) ( x ) in risk minimization ( c ) How would you modify your optimum decision rule if lambda ( d ) 0 What happens if lambda ( d ) lambda ( c ) Explain why this is or is not consistent with what one would expect intuitively

The Answer is in the image, click to view ...

Question: 6 Risk Minimization with Doubt Suppose we have a classification problem with classes labeled 1 , dots,c and an additional doubt category labeled c +

6

Risk Minimization with Doubt

Suppose we have a classification problem with classes labeled

1,

dots,c and an additional "doubt"

category labeled c

+ 1 .

Let f:R

^(

) - > {1,

dots,c

+ 1}

be a decision rule. Define the loss function

(

(

),

) = {(0

if f

(

) =

,

(

)

{1,

dots,c

},), (\

lambda

_(

)

if f

(

)! =

,

(

)

{1,

dots,c

}), (\

lambda

_(

)

if f

(

) =

+ 1)

}

where

\

lambda

_(

) > = 0

is the loss incurred for making a misclassification and

\

lambda

_(

) > = 0

is the loss incurred for

choosing doubt. In words this means the following:

When you are correct, you should incur no loss.

When you are incorrect, you should incur some penalty

\

lambda

_(

)

for making the wrong choice.

When you are unsure about what to choose, you might want to select a category correspond

-

ing to "doubt" and you should incur a penalty

\

lambda

_(

) .

In lecture, you saw a definition of risk over the expectation of data points. We can also define the

risk of classifying a new individual data point x as class f

(

)

{1, 2,

dots,c

+ 1}

(

(

) |

) = \

sum

_(

= 1)^

c L

(

(

),

)

(

=

|

)

(

)

First, we will simplify the risk function using our specific loss function separately for when

(

)

is or is not the doubt category.

.

Prove that R

(

(

) =

|

) = \

lambda

_(

) (1 -

(

=

|

))

when ii

! =

+ 1 .

.

Prove that R

(

(

) =

+ 1 |

) = \

lambda

_(

) .

(

)

Show that the following policy f

_(

opt

) (

)

obtains the minimum risk:

(

1)

Find the non

-

doubt class i such that P

(

=

|

) > =

(

=

|

)

for all j

,

meaning

you pick the class with the highest probability given x

.

(

2)

Choose class i if P

(

=

|

) > = 1 - (\

lambda

_(

)) / (\

lambda

_(

))

(

3)

Choose doubt otherwise.

Hint: In order to prove that f

_(

opt

) (

)

minimizes risk, consider proof techniques that show that

_(

opt

) (

)

"stays ahead" of all other policies that don't follow these rules. For example, you could

take a proof

-

-

contradiction approach: assume there exists some other policy, say f

^(') (

),

that

minimizes risk more than f

_(

opt

) (

) .

What are the scenarios where the predictions made by f

_(

opt

) (

)

and f

^(') (

)

might differ? In these scenarios, and based on the rules above that f

_(

opt

) (

)

follows,

why would f

^(') (

)

not be able to beat f

_(

opt

) (

)

in risk minimization?

(

)

How would you modify your optimum decision rule if

\

lambda

_(

) = 0 ?

What happens if

\

lambda

_(

) > \

lambda

_(

) ?

Explain why this is or is not consistent with what one would expect intuitively.

6 Risk Minimization with Doubt Suppose we have a

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Programming Questions!

4 Classification Suppose we have a classification problem with classes labeled 1,....c and an additional "doubt" category labeled c + l. Let r : Rd-, { I, . . . , c + 1 } be a decision rule. Define...

please help with below question as per the attached article on Using Supply Chain Analysis to Examine the Costs of NonTariff Measures (NTMs) : Michael J Ferrantino, WTO 1. Critically examine the...

Students will review examples and evaluate them. Review these documents and evaluate them (click on the link): https://1drv.ms/w/s!AoYu6G3CLyuakjVCGipkRkNSBVUB?e=jrPXX6...

Contract Law: General Theories 4000 4000 CONTRACT LAW: GENERAL THEORIES of the interpretation of most other legal documents, such as wills (see Chapter 5830) or legislative enactments (see Chapter...

I need the multiple choice questions answers from chapter 4 and 5 of the attached & Monograph...

Rev.Confirming Pages C H A P T E R 7 Planning, Composing, and Revising Chapter Outline The Ways Good Writers Write Activities in the Composing Process Using Your Time Effectively Brainstorming,...

Due to the changing environment and external triggers, contingency planning is necessary. What qualities make a future issue a ?trigger?? Consider you are on the strategic planning team for a soft...

Through the use of strategic alternatives, companies may compete in a marketplace, achieve its vision, or if no vision has been articulated, decide where it might go and what it might achieve....

1. Read sections I, II, IV, V, VI and VIII of the IDB document titled 'Financial Regulation in the English-Speaking Caribbean: Is it Helping or Hindering Microfinance?' which is located at...

1.Read sections I, II, IV, V, VI and VIII of the IDB document titled 'Financial Regulation in the English-Speaking Caribbean: Is it Helping or Hindering Microfinance?' which is located at...

Homer Simpson is very sick and goes to see Dr.Nick Rivera. Without further treatment, Homer will die in about 3 months. The only treatment alternative is a risky operation. Homer is expected to live...

Predict the oxidation product of treating dihydronaphthalene with the reagents shown below. only draw one enantiomer if more than one is possible. include h?s on chirality centers. (scuba =...

Prior years' financial statements are restated when the prospective approach is used. True, unselectedFalse, unselected

double exponential smoothing