Question: Machine Learning Exercise 3.6 [Cross-entropy error measure) (a) More generally, if we are learning from +1 data to predict a noisy target Py | x)

Machine Learning

Machine Learning Exercise 3.6 [Cross-entropy error measure) (a) More generally, if we

Exercise 3.6 [Cross-entropy error measure) (a) More generally, if we are learning from +1 data to predict a noisy target Py | x) with candidate hypothesis h, show that the maximum likelihood method reduces to the task of finding h that minimizes N 1 1 Ein(w) = [yn = +1] In + [yn h(xn) = 1] In 1 h(Xn) n=1 (b) For the case h(x) = 0(w"x), argue that minimizing the in sample error in part (a) is equivalent to minimizing the one in (3.9). For two probability distributions {p, 1 p} and {q,1 q} with binary out- comes, the cross entropy (from information theory) is 1 plog = + (1 - p) log 9 1-4 The in sample error in part (a) corresponds to a cross entropy error measure on the data point (Xn, Yn), with p= [yn = +1] and q = h(xn). N Ein(w) In In (1+e+vwx. N n=1 (3.9) Exercise 3.6 [Cross-entropy error measure) (a) More generally, if we are learning from +1 data to predict a noisy target Py | x) with candidate hypothesis h, show that the maximum likelihood method reduces to the task of finding h that minimizes N 1 1 Ein(w) = [yn = +1] In + [yn h(xn) = 1] In 1 h(Xn) n=1 (b) For the case h(x) = 0(w"x), argue that minimizing the in sample error in part (a) is equivalent to minimizing the one in (3.9). For two probability distributions {p, 1 p} and {q,1 q} with binary out- comes, the cross entropy (from information theory) is 1 plog = + (1 - p) log 9 1-4 The in sample error in part (a) corresponds to a cross entropy error measure on the data point (Xn, Yn), with p= [yn = +1] and q = h(xn). N Ein(w) In In (1+e+vwx. N n=1 (3.9)

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!

Mechine Learning It from 3.3 Logistic regression from Leaning From Data Exercise 3.6 ICross-entropy error measure] (a) More generally, if we are learning from data to predict a noisy target P(u | x)...

(1) Which of the three company's approaches to using people analytics for talent acquisition and development is most appealing (or most concerning)? (2) Should Fukuhara turn on the most advanced part...

MATHEMATICS FOR MACHINE LEARNING Marc Peter Deisenroth A. Aldo Faisal Cheng Soon Ong Contents Foreword 1 Part I Mathematical Foundations 9 1 Introduction and Motivation 11 1.1 Finding Words for...

HR How to ensure machine learning algorithms do not learn the same mistakes and biases that currently affect the recruiting process? 8 RECRUITMENT In this chapter we will turn our attention to the...

Al-Driven Contextual Advertising: Toward Relevant Messaging Without Personal Data E. Haglund and J. Bjorklund Department of Computing Science, Umea University, Umed, Sweden ABSTRACT In programmatic...

Question 1 A training set is a collection of data. An individual datum point in this training set is called an instance, sample, or an observation. True False Question 2 Machine learning algorithms...

Your reflection notes should be organized to present: Key learning points from the three readings of your choice (a summary) Your reflections References Reflections is underlined as this is...

I am struggling to do the proper computations for some simple accounting. After i get this down, i will produce a paper on the importance of each number i solved for. Can someone please help me...

Follow the steps given in Machine Learning With R , Chapter 5, section "Example Identifying Risky Bank Loans Using C5.0 Decision Trees." download the credit. csv file from Packt Publishing's website...

Multinational transfer pricing, goal congruence (continuation of 22-23) Suppose that the U.S. division could sell as many units of Product 4A36 as it makes at $900 per unit in the U.S. market, net of...

Describe three situations (not including the examples given in the text) in which people use rules of thumb to make complex decisions. What rules do they tend to use? Do those rules strike you as...

An article in the Wall Street Journal noted that large commercial banks such as Wells Fargo and Citigroup have been making loans to nonbank financial firms. "The nonbanks turn a profit by charging [...

Give a DFA for the following language over the alphabet E = {0,1,2}: L = {w the sum of the symbols in w is a multiple of 4} For example, 121202 is part of the language because the sum of all its...