Question: [AI_107_A_2] 2. (3 pts each, 18 pts total)Consider a binary-class classification problem, where there are equal numbers of positive and negative examples in the data

[AI_107_A_2] 2. (3 pts each, 18 pts total)Consider a binary-class classification problem, where there are equal

numbers of positive and negative examples in the data set. Also suppose that the class label (i.e. +1)

is totally randomly assigned to these examples. Now for the following two classification models.

estimate their test accuracies based on different estimation methods.

(i) A perfect decision tree model trained from the training data without pruning.

Repeated random subsampling method, in which 2/3 of data for training, the rest for testing.

Test Accuracy = ?

Stratified 10-fold CV.

Test Accuracy = ?

LOOCV.

Test Accuracy = ?

(ii) A majority predictor model trained from the training data that always predicts the majority class in

training data.

Repeated random subsampling method, in which 2/3 of data for training, the rest for testing.

Test Accuracy = ?

Stratified 10-fold CV.

Test Accuracy = ?

LOOCV.

Test Accuracy = ?

[AI_107_A_2] 2. (3 pts each, 18 pts total)Consider a binary-class classification problem,

2. (3 pts each, 18 pts total)Consider a binary-class classification problem, where there are equal numbers of positive and negative examples in the data set. Also suppose that the class label (i.e. + is totally randomly assigned to these examples. Now for the following two classification models, estimate their test accuracies based on different estimation methods. (i) A perfect decision tree model trained from the training data without pruning. (a) Repeated random subsampling method, in which 2/3 of data for training, the rest for testing. Test Accuracy = ? (b) Stratified 10-fold CV. Test Accuracy = ? (c) LOOCV. Test Accuracy = ? (ii) A majority predictor model trained from the training data that always predicts the majority class in training data. (a) Repeated random subsampling method, in which 2/3 of data for training, the rest for testing. Test Accuracy = ? (b) Stratified 10-fold CV. Test Accuracy = ? (c) LOOCV. Test Accuracy = ? 2. (3 pts each, 18 pts total)Consider a binary-class classification problem, where there are equal numbers of positive and negative examples in the data set. Also suppose that the class label (i.e. + is totally randomly assigned to these examples. Now for the following two classification models, estimate their test accuracies based on different estimation methods. (i) A perfect decision tree model trained from the training data without pruning. (a) Repeated random subsampling method, in which 2/3 of data for training, the rest for testing. Test Accuracy = ? (b) Stratified 10-fold CV. Test Accuracy = ? (c) LOOCV. Test Accuracy = ? (ii) A majority predictor model trained from the training data that always predicts the majority class in training data. (a) Repeated random subsampling method, in which 2/3 of data for training, the rest for testing. Test Accuracy = ? (b) Stratified 10-fold CV. Test Accuracy = ? (c) LOOCV. Test Accuracy =

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!

Show work/explanation where indicated. Answers without any work may earn little, if any, credit. You may type or write your work in your copy of the exam, or if you prefer, create a document...

Possible Multiple Choice Questions for the Exam. Focus on the topics discussed in class. Chapter 1 Multiple Choice Identify the choice that best completes the statement or answers the question. ____...

SAMPLING MEAN: DEFINITION: The term sampling mean is a statistical term used to describe the properties of statistical distributions. In statistical terms, the sample mean from a group of...

Hidden Markov models (HMM) are widely used in Bioinformatics. (i) In a HMM when would you use the Baum-Welch algorithm, and when the Viterbi algorithm, and why? Give biologically motivated examples....

ECON 1203 Tutorial Workshop Questions Semester 1 2016 ***This document will be periodically updated with questions to be discussed in succeeding tutorials, and re-posted to Moodle every fortnight.***...

Week 4 Homework Assignment This assignment is due by 6 p.m. eastern time on Sunday, September 17.When you turn it in, please: (1) Include your team number and assignment number in the filename(s)....

Utilizing the text book,pages 264-278 In paragraph (1) Identify one country that has a current and excessive Trade Surplus. Identify the products that are responsible for the Trade Surplus, you may...

Utilizing the text book,pages 264-278. In paragraph (1) Identify one country that has a current and excessive Trade Surplus. Identify the products that are responsible for the Trade Surplus, you may...

EXECUTIVE SUMMARY Hasbys.com is an internet based consumer deals website that features discounts on various produ cts and services within Ghana and across the world. It is the premier online platform...

STAT 200: Introduction to Statistics Final Examination, Spring 2016 OL1/US1 Page 1 of 7 STAT 200 OL1/US1 Sections Final Exam Spring 2016 The final exam will be posted at 12:01 am on March 4, and it...

Convert this ERD diagram into a relational model and implement the relational model into Oracle SQL. PROFESSOR 11D LNAME o FNAME DEPARTMENT PHONE EMAIL teaches is taught by L CLASS #COURSENO #...

At around 8 p.m. on a New Years Eve, a mother and her two young children were walking home in San Francisco. At a busy intersection, the family waited for the walk signal and then started across the...

1. Given the following scenario, choose the best answer. At the end of the quarter, the business owed $2,000 in total payroll taxes. The amount due must be deposited: a. on the last business day of...

Seved Help 14 Wisconsin Snowmobile Corp. is considering a switch to level production Cost efficiencies would occur under level production, and aftertax costs would decline by $31,500, but inventory...

Imagine that the U.S. Congress, recognizing the importance of being well dressed, started giving preferential tax treatment to clothing insurance. Under this new type of insurance, you would pay the...

Find some information on an index fund (such as the Vanguard Total Stock Market Index, ticker symbol VTSMX). How has this fund performed compared with other stock mutual funds over the past 5 or 10...

According to an old myth, Native Americans sold the island of Manhattan about 400 years ago for $24. If they had invested this amount at an interest rate of 7 percent per year, how much would they...