Consider the two-class classification problem where the class label y (0; 1) and each training...
Fantastic news! We've Found the answer you've been seeking!
Question:
Transcribed Image Text:
Consider the two-class classification problem where the class label y € (0; 1) and each training example X has 2 binary attributes X₁, X₂ € (0; 1). Let the class prior be P(Y = 1) = 0.5 and let P(X₂ = 1 Y = 1) = 0.8. P(X₂ = 1 | Y= 1) = 0.5, So, attribute X₁ provides slightly stronger evidence about the class label than X₂ Assume X, and X₂ are truly independent given Y. given X₁ = x, and X₂ = x₂, write down the Naive Bayes decision rule. Fill out the following table of predictions, f(X₁, X₂) € (0.1), based on the Naive Bayes decision rule for each of the 4 settings for X₁, X₂. Please show your calculations. b. X₁ X₂ f(X₁X₂) 0 0 1 1 (c) OHOA 0 1 0 P(X₁ = 01Y = 0) = 0.7, P(X₂=0|1= 0) = 0.9. 1 For the Naive Bayes decision function f(X₁, X₂), the error rate is: I(YfX₁, X₂))P(X₁, X₂,Y) where I(Y = f(X₁X₂)) = 1ify = f(X₁, X₂) and 0 otherwise. For this question, we will assume that the true data distribution is the same as the Naive Bayes distribution, so P(X₁, X₂Y) we can be written as P(Y)P(X₁|Y)P(X₂ | Y). Show the error rate of the Naive Bayes Classifier is 0.235. c. Now, suppose that we create a new attribute X3, which is an exact copy of X₂. So, for every training example, attributes X₂and X3 have the same value, X₂ = X3. (a) What is the error rate of Naive Bayes now, using X₁, X₂, and X3? The predicted y should be computed using the assumption of conditional independence, and the error rate should be computed using the true probabilities. (b) Why does Naive Bayes perform worse with the addition of X3? Hint: Does the key assumption of Naive Bayes still hold? Now consider a logistic regression model M, with weight vector w = [w₁, W₂] that is used to predict Y using X, and X₂ and another model M', with weight vector w' = [w, w, w] that is used to predict Y using X₁, X2, and X3. What is the relation between w and w' after training both models? Will the trained logistic regression model M' suffer from the same problem as the Naive Bayes model in part (a)? Explain why or why not. Consider the two-class classification problem where the class label y € (0; 1) and each training example X has 2 binary attributes X₁, X₂ € (0; 1). Let the class prior be P(Y = 1) = 0.5 and let P(X₂ = 1 Y = 1) = 0.8. P(X₂ = 1 | Y= 1) = 0.5, So, attribute X₁ provides slightly stronger evidence about the class label than X₂ Assume X, and X₂ are truly independent given Y. given X₁ = x, and X₂ = x₂, write down the Naive Bayes decision rule. Fill out the following table of predictions, f(X₁, X₂) € (0.1), based on the Naive Bayes decision rule for each of the 4 settings for X₁, X₂. Please show your calculations. b. X₁ X₂ f(X₁X₂) 0 0 1 1 (c) OHOA 0 1 0 P(X₁ = 01Y = 0) = 0.7, P(X₂=0|1= 0) = 0.9. 1 For the Naive Bayes decision function f(X₁, X₂), the error rate is: I(YfX₁, X₂))P(X₁, X₂,Y) where I(Y = f(X₁X₂)) = 1ify = f(X₁, X₂) and 0 otherwise. For this question, we will assume that the true data distribution is the same as the Naive Bayes distribution, so P(X₁, X₂Y) we can be written as P(Y)P(X₁|Y)P(X₂ | Y). Show the error rate of the Naive Bayes Classifier is 0.235. c. Now, suppose that we create a new attribute X3, which is an exact copy of X₂. So, for every training example, attributes X₂and X3 have the same value, X₂ = X3. (a) What is the error rate of Naive Bayes now, using X₁, X₂, and X3? The predicted y should be computed using the assumption of conditional independence, and the error rate should be computed using the true probabilities. (b) Why does Naive Bayes perform worse with the addition of X3? Hint: Does the key assumption of Naive Bayes still hold? Now consider a logistic regression model M, with weight vector w = [w₁, W₂] that is used to predict Y using X, and X₂ and another model M', with weight vector w' = [w, w, w] that is used to predict Y using X₁, X2, and X3. What is the relation between w and w' after training both models? Will the trained logistic regression model M' suffer from the same problem as the Naive Bayes model in part (a)? Explain why or why not.
Expert Answer:
Answer rating: 100% (QA)
a The error rate of the Naive Bayes Classifier using X1 X2 is Error rate PY fX1 X2 PY 0 fX1 X2 1 PY ... View the full answer
Related Book For
Probability and Stochastic Processes A Friendly Introduction for Electrical and Computer Engineers
ISBN: 978-1118324561
3rd edition
Authors: Roy D. Yates, David J. Goodman
Posted Date:
Students also viewed these computer engineering questions
-
Fill out the following table for substance water
-
Fill out the following table for substance ammonia
-
Fill out the following table for substance ammonia:
-
The current Occupational Safety and Health Administration (OSHA) ceiling limit for H 2 S is up to 20 ppm for general industry. If the H 2 S meter reads 35 mg/m 3 , is this level above or below the...
-
A larger proton gradient is required to form a single ATP in chloroplasts than in mitochondria. Suggest a reason why. Hint: Ions can move across the thylakoid mem-brane more easily than across the...
-
An escalator raises a 100 kg bucket of sand 10 m in 1 minute. Determine the rate of work done during the process.
-
This creative dynamic visualization by Nathan Yau (flowingdata.com) shows a day in the life of 1000 different representative Americans based on survey data: http://flowingdata...
-
Bill Watts has decided to buy a sub shop franchise called Super Subs. He lives in Cedar Falls, Iowa, and will be the first Super Subs franchisee in the state. Along with buying a Super Subs...
-
We use checksum in both TCP and UDP to detect errors. Assume you have the following 2 bytes: 11011100 and 01100111. What would be the 1s complement of the addition of the above two bytes? Make sure...
-
Contrast the Islamic family with the socialist-law family.
-
ABC Company (ABC) is a company that manufactures bill boards which is based in Ondangwa. The owner of ABC has little accounting background but due to budget constraints they are not able to hire a...
-
Generate random excel scenarios using the specified excel functions. NO INFORMATION IS MISSING, THE DATASETS CAN BE RANDOM. *****EACH SCENARIO CAN HAVE RANDOM DATASETS, IT DOES NOT HAVE TO BE THE...
-
Introduction Match Four is a two-player connection board game, consisting in two players taking turns to drop colored discs into a vertically suspended grid. The pieces fall straight down, occupying...
-
I am learning to program a game engine which is why I followed a tutorial, with that tutorial I have gotten this far and even though my code is identical to theirs (theirs did work in the videos) its...
-
Using the provided CSU Widget Factory OSHA 3 0 0 A log , calculate the total recordable incidence rate ( TRIR ) ; the days away, restricted, or transferred ( DART ) rate; the lost workday injury and...
-
Need help with these questions. Functions defined outside such as BigInteger class, StringBuilder class, methods from Math class (used as Math.(), such as Math.max(), Math.min(), Math.abs() ... ) or...
-
An adjustment to the required return on common stock equity to reflect underpricing and flotation costs is necessary to find the cost of new issues of common stock. O a. False O b. True
-
Q1) What is the a3 Value Q2) What is the a7 Value Q3) What is the a4 Value Q4) What is the b3 Value Q5) What is the b2 Value Q6) What is the sign of 2nd constraint? A pastry chef at a bakery wants to...
-
An n-dimensional Gaussian vector W has a block diagonal covariance matrix where C X is m à m, C Y is (n m) à (n m). Show that W can be written in terms of component vectors X and Y in...
-
In Problem 11.4.5, we used simulation to estimate the probability of symbol error. For transmitting a binary bit stream over an MPSK system, we set each M = 2N and each transmitted symbol corresponds...
-
Random variable X and Y have joint PDF (a) Find the marginal PDFs fx(x) and fy(y). (b) What are E[x] and Var [X]? (c) What are E[Y] and Var [Y]? 0Sy 2, otherwise
-
Christian Dior S.A. is a successful family business with its famous fashion and perfume brands. Its latest effort in growing the business involves opening a series of international outlets to...
-
A group of retired engineering professors from Singapore Polytechnic has decided to form a small manufacturing firm that will produce a full line of automation equipment. The investors have proposed...
-
Footwear, Inc. manufactures a complete line of mens and womens formal shoes for independent merchants. The average selling price of its finished product is \($85\) per pair. The variable cost for...
Study smarter with the SolutionInn App