Question: In order to reduce email load, I decided to implement a machine learning algorithm to decide whether or not I should read an email, or

In order to reduce email load, I decided to implement a machine learning algorithm to decide whether or not I should read an email, or simply let it away instead. To train my model, I obtain the following data set of binary-valued features about each email, including whether I know the author or not, whether the email is long or short, and whether it has any of several keywords, along with my final decision about whether to read it (y = +1 for 'read', y = -1 for 'discard').

Problem 4: Bayes Classifiers In order to reduce my email load, I decided to implement a machine learning algorithm to decide whether or not I should read an email, or simply let it away instead. To train my model, I obtain the following data set of binary-valued features about each email, including whether I know the author or not, whether the email is long or short, and whether it has any of several keywords, along with my final decision about whether to read it (y = +1 for 'read', y = -1 for 'discard"). know is long? has has has 'lottery' read the 'research'? grade'? x5 y author? x2 x3 x4 x1 0 0 1 0 -1 1 1 0 0 -1 0 1 -1 1 1 0 -1 0 0 -1 0 +1 0 0 +1 1 0 0 0 0 +1 0 +1 -1 In the case of any ties, we will prefer to predict class +1. I decide to try a Bayes classifier to make my decisions and compute my uncertainty. (a) Compute all the probabilities necessary for a naive Bayes classifier, i.e., the class probability p(y) and all the individual feature probabilities p(x,|y), for each class y and feature x;. Which class would be predicted for x = (0 0 0 0 0)? What about for x = (1 10 10)? (b) Compute the posterior probability that y = +1 given the observation x = (1 10 10). (c) Why should we probably not use a Bayes classifier (using the joint probability of the features x, as opposed to a naive Bayes classifier) for these data

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Mathematics Questions!

Q 1. To reduce my email load, I decide to implement a machine learning algorithm to decide whether or not I should read an email, or simply file it away instead. To train my model, I obtain the...

Topic: Decision Tree Please show step by step on how you got your answer. Problem 3: Decision Tree In order to reduce my email load, I decide to implement a machine-learning algorithm to decide...

Problem 2: Decision Trees for Spam Classification (30 points) We'll use the same data as in our earlier homework: In order to reduce my email load, I decide to implement a machine learning algorithm...

Python and most Python libraries are free to download or use, though many users use Python through a paid service. Paid services help IT organizations manage the risks associated with the use of...

MATHEMATICS FOR MACHINE LEARNING Marc Peter Deisenroth A. Aldo Faisal Cheng Soon Ong Contents Foreword 1 Part I Mathematical Foundations 9 1 Introduction and Motivation 11 1.1 Finding Words for...

Through the use of strategic alternatives, companies may compete in a marketplace, achieve its vision, or if no vision has been articulated, decide where it might go and what it might achieve....

Due to the changing environment and external triggers, contingency planning is necessary. What qualities make a future issue a ?trigger?? Consider you are on the strategic planning team for a soft...

answer the question clearly You are building a flight-control system for which a convincing safety case must be made. Would you assign the tasks of safety requirements engineering, test case...

This text was adapted by The Saylor Foundation under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 License without attribution as requested by the work's original creator or licensee. 1...

MATHEMATICIANS RISE TO A CHALLENGE ne of the theorems we teach in eighth grade is a + b= *, where c is the length of the hypotenuse of a right triangle in Euclidean space, and a and b are the lengths...

What elements of the data warehousing environment at Continental are necessary to support the extensive end-user BI application development that occurs? Continental Airlines was founded in 1934 with...

What are the two financial requirements to support the declaration of a cash dividend? What are the effects of a cash dividend on assets and stockholders equity?

Select the correct choices that complete the sentence below. The coinsurance clause in a homeowners insurance policy requires a home to be insured for _____ of its _____ for full compensation for a...

Describe three restrictions that define nonprofit organizations?

2. You are asked to provide information to the board of the company you work for. How would you convey this information if the aim is: a) That they will remember the information for a long time. b)...

7. Give examples of information-overload from your everyday life. Review the six consequences that information-overload can lead to and discuss the examples in relation to all the six points.

6. What does it mean when we say that our interpretation of information is dependent on our prior knowledge and our values?