1. Consider the training data set below. Your goal is to build a classifier to predict...
Fantastic news! We've Found the answer you've been seeking!
Question:
Transcribed Image Text:
1. Consider the training data set below. Your goal is to build a classifier to predict the last column "Beach" using the input attributes: "Thunder", "Hailstorm", "Homework" and "Tsunami". More specifically, you will compare Naive Bayes (NB) and Decision Tree (DT). For Naive Bayes (NB), you will use m-estimate from the lecture with m = 0.001 and p = 0.5 for probability estimations. For example, based on the training dataset below, the original probability estimation for P(Beach = Yes) = 9/11 while its corresponding m-estimate is: P(Beach = Yes) = (9+m*p)/(14+m) = (9+0.001*0.5)/(14+0.001). For Decision Tree (DT), you will follow the lecture to build your trees, where multiple-way splitting is allowed and Information Gain (IG) is employed to select the best attribute. In the case of ties, break ties in favor of the leftmost feature. Homework Tsunami Day Thunder Hailstorm. Beach E Overcast Cool High Wonk Yes 2 Sunny Hot High Strong No 3 Sunny Hot High Strong Yes 4 Overcast Cool High Weak Ye 5 Sunny Hot Normal Strong No G Sunny Cool High Strong 7 Overcast Cool Normal Weak Yes 8 Sunny Hot Normal Strong No 9 Overcast Hot Normal Weak Yes 10 Overcast Hot High Strong Yes 11 Overcast Cool Normal Strong No 12 Sunny Hot High Weak No 13 Overcast Hot High Strong Yes 14 Sunny Cool High Weak No (i) (10 points) Compare the performance of NB vs. DT using 2-fold cross-validation (CV) and report their 2-fold CV accuracy. For the ith fold, the testing dataset is composed of all the data points whose (Day mod 2 = 1-1). For each fold, show the induced Naive Bayes and DT models. (5 points) Based on the 2-fold CV accuracy from (a), which classifier, NB or DT, would you choose? Report your final model for the selected classifier. 1. Consider the training data set below. Your goal is to build a classifier to predict the last column "Beach" using the input attributes: "Thunder", "Hailstorm", "Homework" and "Tsunami". More specifically, you will compare Naive Bayes (NB) and Decision Tree (DT). For Naive Bayes (NB), you will use m-estimate from the lecture with m = 0.001 and p = 0.5 for probability estimations. For example, based on the training dataset below, the original probability estimation for P(Beach = Yes) = 9/11 while its corresponding m-estimate is: P(Beach = Yes) = (9+m*p)/(14+m) = (9+0.001*0.5)/(14+0.001). For Decision Tree (DT), you will follow the lecture to build your trees, where multiple-way splitting is allowed and Information Gain (IG) is employed to select the best attribute. In the case of ties, break ties in favor of the leftmost feature. Homework Tsunami Day Thunder Hailstorm. Beach E Overcast Cool High Wonk Yes 2 Sunny Hot High Strong No 3 Sunny Hot High Strong Yes 4 Overcast Cool High Weak Ye 5 Sunny Hot Normal Strong No G Sunny Cool High Strong 7 Overcast Cool Normal Weak Yes 8 Sunny Hot Normal Strong No 9 Overcast Hot Normal Weak Yes 10 Overcast Hot High Strong Yes 11 Overcast Cool Normal Strong No 12 Sunny Hot High Weak No 13 Overcast Hot High Strong Yes 14 Sunny Cool High Weak No (i) (10 points) Compare the performance of NB vs. DT using 2-fold cross-validation (CV) and report their 2-fold CV accuracy. For the ith fold, the testing dataset is composed of all the data points whose (Day mod 2 = 1-1). For each fold, show the induced Naive Bayes and DT models. (5 points) Based on the 2-fold CV accuracy from (a), which classifier, NB or DT, would you choose? Report your final model for the selected classifier.
Expert Answer:
Answer rating: 100% (QA)
package testjavafx import javafxapplication Application import javafxevent ActionEvent import javafxevent EventHandler import javafxsceneScene import javafxscenecontrolButton import javafxscenelayoutStackPane import javafx ... View the full answer
Related Book For
Business Statistics in Practice Using Data Modeling and Analytics
ISBN: 978-1259549465
8th edition
Authors: Bruce L Bowerman, Richard T O'Connell, Emilly S. Murphree
Posted Date:
Students also viewed these accounting questions
-
Your goal is to synthesize 4, 4-dimethyl-2-pentyne. You have a choice of beginning with any of the following reagents: Assume that you also have available sodium amide and liquid ammonia. Outline the...
-
As newly appointed Energy Czar, your goal is to reduce the total demand for residential heating fuel in your state. You must choose one of three legislative proposals designed to accomplish this...
-
As newly appointed "Energy Czar," your goal is to reduce the total demand for residential heating fuel in your state. You must choose one of three legislative proposals designed to accomplish this...
-
A 1220-N uniform beam is attached to a vertical wall at one end and is supported by a cable at the other end. A 1960-N crate hangs from the far end of the beam. Using the data shown in the drawing,...
-
Workplace Products Inc., a wholesaler of office products, was organized on February 1 of the current year, with an authorization of 10,000 shares of preferred 2% stock, $120 par and 250,000 shares of...
-
The definite integral can be used to find the area between two curves. In each case, a. Sketch the graphs of the two functions over the given range, and shade the area between the curves. b. Sketch...
-
Test H0: 1 = 0 versus H1: 1 0 at the = 0.05 level. Repeat for 2 and 3. Exercises 1115 refer to the following data set: x1 *2 x3 69.8 7.9 37.3 62.4 32.3 9.3 20.2 40.7 66.9 13.3 30.5 48.7 87.5 27.4...
-
Superior Co. manufactures automobile parts for sale to the major U.S. automakers. Superior's internal audit staff is to review the internal controls over machinery and equipment and make...
-
1 . Diameter of 2 cm is heated to a temperature of 8 0 0 K and placed in a vacuum environment with a temperature of 5 0 K . If the emissivity of the object is 0 . 4 , calculate:A ) The rate of heat...
-
Sandy and John Ferguson got married eight years ago and have a seven-year-old daughter, Samantha. In 2022, Sandy worked as a computer technician at a local university, earning a salary of $152,000,...
-
Use the following bond quotation: Issuer Symbol Callable Coupon Maturity Rating Price Yield Walmart WMT.IM No 4.875 7/8/2040 Aa2 119.58 3.46 What is the yield to maturity?
-
Calculate the price elasticity of demand for gasoline implied by what most studies have found. Why the tepid response to higher gasoline prices? Most studies report that when U.S. gasoline prices...
-
Which of the following utility functions are consistent with weak complementarity? a. b. c. d. e. U(x,z,q) =YxIn(x-8(q) + 0)+In z, 4,0>0, 8(q)>0
-
Apple and Samsung have two pricing strategies: Set a high (monopoly) price or set a low (competitive) price. Suppose that if they both set a competitive price, economic profit for both is zero. If...
-
The Wake County database contains a variable lakedist, which measures the distance (in hundreds of meters) from the property to the nearest lake. Consider using this variable to proxy the value of...
-
Consider the substitutes model in which utility is U(H,z) and the production technology is H=f(x,q). Assume the baseline price of theprivate input x is p=$10 and that the baseline level of the...
-
What would be a great financial gift to get for a 1 year old? So, that over time it grow.
-
In a large midwestern university, 30% of the students live in apartments. If 200 students are randomly selected, find the probability that the number of them living in apartments will be between 55...
-
Given the payoff table, find the alternative that would be chosen using the maximin criterion. The example involves a capacity-planning problem in which a company must choose to build a small,...
-
When we perform the sign test, we use the sample data to compute a p-value. What probability distribution is used to compute the p-value? Explain why.
-
Table 2.12 gives the 2014 franchise value and revenues for each of the 30 teams in Major League Baseball as reported by Forbes magazine and as listed on the Forbes. com website on January 14, 2015....
-
The beam is made from a plate that has a constant thickness \(t\). Determine the width \(w\) as a function of \(x\) so that it has a maximum bending stress \(\sigma_{\max }\) throughout its length...
-
The beam has a constant thickness \(b\). If it supports the distributed loading shown, determine the maximum bending stress \(\sigma_{\max }\) in the beam. ho 22 -- Wo 2ho I ho 22
-
The beam is made from a plate having a constant thickness \(t\) and a width that varies as shown. If it supports a concentrated force \(\mathbf{P}\) at its center, determine the absolute maximum...
Study smarter with the SolutionInn App