Question: Suppose we have a random dataset, i.e. the attribute values are generated independently from the class label, and it contains data points belongs to either

Suppose we have a random dataset, i.e. the attribute values are generated independently from the class label, and it contains data points belongs to either POSITIVE or NEGATIVE classes. Now we need to build a classifier for such a dataset and we use half of the dataset for training while the remaining half for testing purpose. Please answer following questions and provide brief explanation for your answers:

(a)Suppose there are an equal number of positive and negative records in the data and the decision tree classifier predicts every test record to be positive. What is the expected error rate of the classifier on the test data?

(b)Repeat the previous analysis assuming that the classifier predicts each test record to be positive class with probability 0.8 and negative class with probability 0.2.

(c)Suppose two-thirds of the data belong to the positive class and the remaining one-third belong to the negative class. What is the expected error of a classifier that predicts every test record to be positive?

(d)Repeat the previous analysis assuming that the classifier predicts each test record to be positive class with probability 2/3 and negative class with probability 1/3.

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Mathematics Questions!

Company #5 is considering two alternatives for the financing of some equipment. These two alternatives are:1.Issue 60,000 shares of $10 par value common stock at $50 per share.2.Issue $3,000,000, 2...

Which measure of spread is most affected by outliers? At [OR (I percentiles B standard deation [1 range 1 pain The standard deviation A. is in the same units as the mean. B cannot be negative. C is...

Algorithms in Artificial Intelligence (or, the old name: Introduction to Algorithmic Decision Making) Part 1 Based on slides by David Sarne and Lirong Xia Course Tentative Schedule Introduction...

In Exercises find the positive values of p for which the series converges. n=1 n

Suppose that X has a Weibull distribution with = 0.2 and = 100 hours. Determine the mean and variance of X.

At a break-even point of 400 units sold, variable expenses were $2,000. What will the 401st unit sold contribute to profit? A. $0 B. $10 C. $15 D. $5

Let A and B be two events on a sample space such that A B, with P(B) Answered: 1 week ago

"Boston Dollar Store uses the gross method to record purchase discounts, and uses a perpetual inventory system. Boston engaged in the following transactions during April: 4/12 Purchased $15,000 in...

East Lansing Appliances (ELA) expects to have sales this year of $15 million under its current credit policy. The present terms are net 40; the days sales outstanding (DSO) is 50 days; and the bad...

Read the Scenario Congratulations, you are now the Police Chief in Anytown, USA. A city with 30,000 residents and you are responsible to provide 24 hour a day police coverage. You have a total of 45...

Can you help me explain this statement: "If you're not paying for the product you are the product" It's hard for me to understand the solid meaning of the statement above. Thank you in advance!

Banana plc is a new firm planning to have an Initial Public Offering prior to the launch of the second-generation iPhone. Four analysts are trying to determine a price for Banana plc shares, but they...

Consider the following securities. They all have the same face value, annual coupon rate and frequency of coupon payments per year. Which one of these would you expect to pay the lowest price at...

As an employee of a large international courier and shipping service, Bill Wiley met with many companies that shipped and received packages almost every day. He was frequently asked if his company...

The professional football club, Highbury Academicals, is considering acquiring a highly successful German international footballer. The transfer fee will be 42m, half of which is payable immediately...

Alexandria is evaluating an idea to open her own business. The initial investment is estimated to be $100,000. The investment horizon is 5 years. She will use a 5-year straight-line depreciation...

please answer the second question Problem 1. Arc Levers is financed by a mixture of debt and equity. You have the following information about its cost of capital. (a) Please fill in the blanks. (b)...

Should we separate the debt and equity features of convertible debt? Team 1: Pro separation: Present arguments in favor of separating the debt and equity features of convertible debt. Team 2: Against...

Take the offer. First USA tested the effectiveness of a double miles campaign by recently sending out offers to a random sample of 50,000 cardholders. Of those, 1184 registered for the promotion....

Educated mothers. The National Center for Education Statistics monitors many aspects of elementary and secondary education nationwide. Their 1996 numbers are often used as a baseline to assess...

Contributions, please. The Paralyzed Veterans of America recently sent letters to a random sample of 100,000 potential donors and received 4781 donations. Theyve had a contribution rate of 5% in past...