Question: C. | | C b) Decision tree with 11 errors a) Decision tree with 8 errors Consider the decision trees shown above. Assume they are

C. | | C b) Decision tree with 11 errors a)

C. | | C b) Decision tree with 11 errors a) Decision tree with 8 errors Consider the decision trees shown above. Assume they are generated from a data set that contains 32 binary attributes and 3 classes, C1, C2, and C3. Compute the total description length of each decision tree according to the minimum description length principle. The total description length of a tree is given by: Cost(tree, data) - Cost(tree) + Cost(dataltree) . Each internal node of the tree is encoded by the ID of the splitting attribute. If there are m attributes, the cost of encoding each attribute is log2m bits. Each leaf is encoded using the ID of the class it is associated with. If there are k classes, the cost of encoding a class is log2k bits. Cost(tree) is the cost of encoding all the nodes in the tree. To simplify the computation you can assume that the total cost of the tree is obtained by adding up the costs of encoding each internal node and each leaf node. . . Cost(data tree) is encoded using the classification errors the tree commits on the training set. Each error is encoded by log2n bits, where n is the total number of training instances. C. | | C b) Decision tree with 11 errors a) Decision tree with 8 errors Consider the decision trees shown above. Assume they are generated from a data set that contains 32 binary attributes and 3 classes, C1, C2, and C3. Compute the total description length of each decision tree according to the minimum description length principle. The total description length of a tree is given by: Cost(tree, data) - Cost(tree) + Cost(dataltree) . Each internal node of the tree is encoded by the ID of the splitting attribute. If there are m attributes, the cost of encoding each attribute is log2m bits. Each leaf is encoded using the ID of the class it is associated with. If there are k classes, the cost of encoding a class is log2k bits. Cost(tree) is the cost of encoding all the nodes in the tree. To simplify the computation you can assume that the total cost of the tree is obtained by adding up the costs of encoding each internal node and each leaf node. . . Cost(data tree) is encoded using the classification errors the tree commits on the training set. Each error is encoded by log2n bits, where n is the total number of training instances

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!

C. | | C b) Decision tree with 11 errors a) Decision tree with 8 errors Consider the decision trees shown above. Assume they are generated from a data set that contains 32 binary attributes and 3...

ID Salary 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 59.3 26.5 34.1 56.3 46.8 79.5 41.2 23.6 76 22.7 23.8 62.9 41.4 22.3 23.1 46.6 67.6 33.8 24 35 75.2 51.5 22.1 55.4 24.9 24.3 42.9 76.6 75.4...

likelihood it was fabricated by Proctor and Co.? [16 marks] 5 In 2n autonomous preliminaries the likelihood of accomplishment is p1 in every one of the primary n preliminaries and p2 in the excess n....

Attached, kindly find SNHU Course material for QSO-320 Management Science Spreadsheet Management - Milestone Four. Thank you! QSO 320 Final Project Guidelines and Rubric Overview The final project...

NEED HELP WITH #4 D & E ##### IS 470 Homework 2----------------------------------------------------------- ### ------------------------------------------------------------------------------- #Banks...

Consider the training data given in Table 2 for classification, where the two classes of interest are and + We want to apply binary decision trees as our chosen algorithm for classifying this data....

10. The decision-maker using Maximin criterion on the problem below would choose Alternative row maximum is: because the maximum of the 13. A six - months moving sverage forecast is better than a...

Problem 13-17 (Algorithmic) Hemmingway, Inc., is considering a $9 million research and development (R&D) project. Profit projections appear promising, but Hemmingway's president is concerned because...

Determine by inspection if the given set is linearly dependent (a) A = {(1,7,6)T, (2,0,9)T, (3, 1, 5), (4,1, 8)T}. (b) A = {(2,3,5), (0,0,0), (1, 1, 8)"}. (c) A = {(-2, 4, 6, 10)T, (3, -6, -9, 15)"}.

The average number of weeks that banner ads run at a Web site is estimated to be 5.5. You want to check the accuracy of this estimate. A sample of 50 ads reveals a sample average of 5.1 weeks with a...

Place the events describing the extrinsic pathway of apoptosis in the correct sequential order.

The purpose of Lean is to reduce _ _ _ _ , and provide maximum _ _ _ _ to customers. A . inventory, options B . cycle times, throughput C . headcount, returns D . waste, value

Ethics.Teamwork. A manager of a department store tries to follow a company policy that prohibits off-the-clock work requirements of employees;however, store managers have to keep payroll costs below...

Search the Internet to learn about the International Association of Virtual Assistants. Answer the following questions: a. How large is the groups membership? b. What are the dues? c. What are the...

Locate an example of a contract. Describe the four essential elements applicable to this contract. a. Offer and acceptance b. Competency of parties c. Legality of subject matter d. Consideration