Question: Evaluating Hypotheses and Decision Tree a) We trained two classifiers, Decision Tree with 350 examples and tested with 30 examples; and the Support Vector Machine

Evaluating Hypotheses and Decision Tree a) We trained two classifiers, Decision

Evaluating Hypotheses and Decision Tree a) We trained two classifiers, Decision Tree with 350 examples and tested with 30 examples; and the Support Vector Machine with 500 examples and tested with 50 examples. The correctly classified percentage of the Decision Tree is 89% and for the support vector machine it is 92%. Calculate the confidence intervals of the two true errors and comment how that relates to the statistical difference between the results. 2= 1.96 for a confidence level of 95%. [6 marks) b) Use tossing a coin as an example to justify how the expected difference between sample error and true error depends on the size of the data sample. [10 marks] Suppose implement a Decision Tree algorithm with 10-fold Cross Validation for a classification problem and the confusion matrix is shown in Table 4. State the total number of instances, correctly classified instances, and what the recall and precision rates for class C are. Table 4 Confusion Matrix Predicted A B True A 50 30 10 20 30 10 C 0 10 50 [4 marks] d) Considering the example in Table 5, what is the maximal number of leaf (decision) nodes that we can have in a decision tree for this data? What is the maximal ssible of a decision tree for this data? Draw the tree explain which attribute should be the root node and why? Table 5 Training Set Example Instance Classification Attribute Attribute 1 2 3 4 5 6 C1 C1 C2 C1 C2 C2 Y Y Y N N N Y Y N N Y Y [10 marks] e) Using the example in Table 5 to prove that the decrease in entropy impurity provided by a single yeso query can never be greater than one bit. [4 marks]

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!

We trained two classifiers, Decision Tree with 350 examples and tested with 30 examples; and the Support Vector Machine with 500 examples and tested with 50 examples. The correctly classified...

Please provide the summary of the methodology and your understanding of this paper. Incluse necessary figures as well. Rapid Object Detection using a Boosted Cascade of Simple Features single feature...

Follow the steps given in Machine Learning With R , Chapter 5, section "Example Identifying Risky Bank Loans Using C5.0 Decision Trees." download the credit. csv file from Packt Publishing's website...

PA5 Decision Trees (100 pts) Overview and Requirements----------------------- For this programming assignment, we are going to investigate the accuracy of our ID3 decision tree implementation...

Answer three questions below on the article. 1- Racial slur was mentioned in the article. Provide a definition for this term. And analyze how the association racial slur was related to the online...

This chapter presents students and early career executives with a sound understanding of theory. Theory is explored in terms of both anatomy (parts of the whole) and physiology (relationships with...

Algorithms in Artificial Intelligence (or, the old name: Introduction to Algorithmic Decision Making) Part 1 Based on slides by David Sarne and Lirong Xia Course Tentative Schedule Introduction...

Ferrell & Gresham (1985), A Contingency Framework for Understanding Ethical Decision Making in Marketing, Journal of Marketing, Vol. 49 (Summer), pp. 87-96. Just Try and Social Distance This,...

Students will review examples and evaluate them. Review these documents and evaluate them (click on the link): https://1drv.ms/w/s!AoYu6G3CLyuakjVCGipkRkNSBVUB?e=jrPXX6...

A right circular cylinder is measured to have a radius of 10 0.02 inches and a height of 6 0.01 inches. Calculate its volume and use differentials to give an estimate of the possible error.

Penicillin-G is one of the penicillin family of drugs. In which fluid would you expect penicillin-G to be more soluble: stomach acid (pH = 2) or the bloodstream (pH 7.4)? Explain. PhCH CNHS CH3...

The Statement of Cash Flows is prepared based on 1 st the operating activities section, 2 nd the investing activities section and 3 rd the financing activities section. Question 1 7 options: TrueFalse

please draw out structure. I will thumbs up if correct! Thank you Draw the structure of the product of the Diels-Alder reaction below. - Use the wedge/hash bond tools to indicate stereochemistry...

Why is the System Build Process an iterative process?

What phase normally comes directly after the System Build process in a Project?

Name two other algorithms available in SSAS Data Mining other than Decision Trees.