(2) The following table consists of training data from an employee database. (2) The following table consists...
Fantastic news! We've Found the answer you've been seeking!
Question:
(2) The following table consists of training data from an employee database.
Transcribed Image Text:
(2) The following table consists of training data from an employee database. The data have been generalized. For example, "31... 35" for age represents the age range of 31 to 35. For a given row entry, count represents the number of data tuples having the values for department, status, age, and salary given in that row. salary department status sales sales sales . age 31... 35 senior junior 26. 30 35 junior 31. junior 21. 25 systems 66K... 70K systems senior 31... 35 systems junior 26. 30 46K... 50K systems senior 41. 45 66K... 70K 46K... 50K 10 41K... 45K 4 marketing senior 36. 40 marketing junior 31. 35 secretary senior 46... 50 secretary junior 26... 30 4 36K... 40K 26K... 30K 6 count 30 40 40 46K... 50K 26K... 30K 31K... 35K 46K... 50K 20 5 3 3 Let status be the class label attribute. i. [5 points] How would you modify the basic decision tree algorithm to take into consideration the count of each generalized data tuple (i.e., of each row entry)? ii. [10 points] Use your algorithm to construct a decision tree from the given data. iii. [5 points] Given a data tuple having the values "systems", "26... 30", and "46-50K" for the attributes department, age, and salary, respectively, what would a naive Bayesian classification of the status for the tuple be? (2) The following table consists of training data from an employee database. The data have been generalized. For example, "31... 35" for age represents the age range of 31 to 35. For a given row entry, count represents the number of data tuples having the values for department, status, age, and salary given in that row. salary department status sales sales sales . age 31... 35 senior junior 26. 30 35 junior 31. junior 21. 25 systems 66K... 70K systems senior 31... 35 systems junior 26. 30 46K... 50K systems senior 41. 45 66K... 70K 46K... 50K 10 41K... 45K 4 marketing senior 36. 40 marketing junior 31. 35 secretary senior 46... 50 secretary junior 26... 30 4 36K... 40K 26K... 30K 6 count 30 40 40 46K... 50K 26K... 30K 31K... 35K 46K... 50K 20 5 3 3 Let status be the class label attribute. i. [5 points] How would you modify the basic decision tree algorithm to take into consideration the count of each generalized data tuple (i.e., of each row entry)? ii. [10 points] Use your algorithm to construct a decision tree from the given data. iii. [5 points] Given a data tuple having the values "systems", "26... 30", and "46-50K" for the attributes department, age, and salary, respectively, what would a naive Bayesian classification of the status for the tuple be?
Expert Answer:
Answer rating: 100% (QA)
Run info Assessor wekaattributeSelectionInfoGainAttributeEval Search wekaattributeSelectionRanker T ... View the full answer
Related Book For
Posted Date:
Students also viewed these algorithms questions
-
Identify ethical dilemmas that are most challenging for leaders and managers. Q. Describe specific ethical dilemmas that exist within organizations. Q. Identify and recommend ethical solutions to...
-
The following table consists of training data from an employee database. The following table consists of training data from an employee database. The data have been generalized. For example, "31......
-
Table 2.6.4 is an excerpt from a salespersons database of customers. a. What is an elementary unit for this data set? b. What kind of data set is this: univariate, bivariate, or multivariate? c....
-
characterize the duplicate constructor utilized in c++ alongside its overall capacity model explaon the different situations which it is called what is the distinction between CSMA/CD/CSMA/CA what...
-
The information on the following page was obtained from the records of Breanna. Inc.: Account receivable............................................ $ 40,000 Accumulated...
-
Using various employment websites (i.e. Monster.com, Indeed.com, USAjobs.gov) find three (3) careers in finance that you are interested in applying to. Be sure to specifically address why you are...
-
The enzyme lipase catalyzes the hydrolysis of esters of fatty acids. The hydrolysis of p-nitrophenyloctanoate was followed by measuring the appearance of p-nitrophenol in the reaction mixture: The...
-
Waldum Company purchased packaging equipment on January 5, 2012, for $135,000. The equipment was expected to have a useful life of three years, or 18,000 operating hours, and a residual value of...
-
__________ are expenses that can be subtracted from total income in order to calculate tax liability. They may include child care expenses and union dues. deductions debentures tax credits dividends...
-
The Golden Oranges Nursery, which provides facilities for pre-school children on a commercial basis, is preparing its cash budget for next year. A profile of the estimated revenues and expenses for...
-
Describe an ethics checklist a sale manager can follow to determine if he/she is behaving ethically in sales situations/
-
Distinguish between current and accumulated E&P.
-
The following case highlights the right of the taxpayer to select among legitimate business alternatives in order to avoid taxes. Read the case and prepare a written brief.
-
What is the distinction between deductions for adjusted gross income and deductions from adjusted gross income?
-
When are capital expenditures incurred for medical reasons deductible?
-
Can a taxpayer qualify for the household and dependent care credit if he or she is not employed?
-
Correct the false statements. Why are they false? 1.) People with a high need for orientation tend to be resistant to the media's political priorities. 2.) Cultivation theory predicts that the...
-
a. Why does the Wi-Fi Alliance release compatibility testing profiles in waves instead of combining the entire standards features initially? 27a1.) An 802.11ac Wi-Fi compatibility testing profile...
-
Many people do not realize how much a funeral costs and how much these costs can vary from one provider to another. Consider the price of a traditional funeral service with visitation (excluding...
-
a. What salary would you expect for a 50-year-old individual? b. Find the 95% confidence interval for a new individual (from the same population from which the data were drawn) who is 50 years old....
-
For each of the following, say whether it is stationary or nonstationary: a. Autoregressive process. b. Random walk. c. Moving-average process. d. ARMA process.
-
Consider a Lagrangian \(L^{\prime}=L+d f / d t\), where the Lagrangian is \(L=\) \(L\left(q_{k}, \dot{q}_{k}, tight)\), and the function \(f=f\left(q_{k}, tight)\). (a) Show that...
-
Show that the function \(L^{\prime}\) given in the preceding problem must obey Lagrange's equations if \(L\) does, directly from the principle of stationary action. Lagrange's equations do not have...
-
In Example 4.8 we analyzed the case of a bead on a rotating parabolic wire. The energy of the bead was not conserved, but the Hamiltonian was: There is an equilibrium point at \(r=0\) which is...
Study smarter with the SolutionInn App