In order to use association rules to distinguish high risk patients from low risk patients, we...
Fantastic news! We've Found the answer you've been seeking!
Question:
Transcribed Image Text:
In order to use association rules to distinguish high risk patients from low risk patients, we need to identify rules that have good support and high confidence on high cost population, but low support for the not-high-cost population. In particular, we are interested in identifying rules of the type: {group of diagnosis codes} -> {High cost in a future period} The variable TA2 contains the overall cost in the year following the observation period. Define a new variable that equals one if the overall cost is ≥ $20,000, and zero otherwise. Run the Association Rule algorithm again using all diagnosis variables (continue to exclude the diabetes diagnosis) and the new high-cost variable. Set the support as low as 0.1% and the keep the confidence at 10%. Identify the rules with high-cost in the consequent set (HINT: here the appearance parameter we pointed out in class may be helpful, alternatively run as you did before and sort the rules afterwards in R or Excel). a. How many rules of the form {group of diagnosis codes} -> {High cost in a future period} did you identify ? b. Briefly summarize the top 5 rules with the highest confidence (an Exhibit that summarizes the rules translating the diagnosis codes into actual conditions, their support, confidence and lift may be helpful). Point out their key characteristics. c. Briefly summarize the top 5 rules with the highest support (again an Exhibit may be helpful). Point out their key characteristics. d. Comment on the potential role of these rules as features for a prediction model. In order to use association rules to distinguish high risk patients from low risk patients, we need to identify rules that have good support and high confidence on high cost population, but low support for the not-high-cost population. In particular, we are interested in identifying rules of the type: {group of diagnosis codes} -> {High cost in a future period} The variable TA2 contains the overall cost in the year following the observation period. Define a new variable that equals one if the overall cost is ≥ $20,000, and zero otherwise. Run the Association Rule algorithm again using all diagnosis variables (continue to exclude the diabetes diagnosis) and the new high-cost variable. Set the support as low as 0.1% and the keep the confidence at 10%. Identify the rules with high-cost in the consequent set (HINT: here the appearance parameter we pointed out in class may be helpful, alternatively run as you did before and sort the rules afterwards in R or Excel). a. How many rules of the form {group of diagnosis codes} -> {High cost in a future period} did you identify ? b. Briefly summarize the top 5 rules with the highest confidence (an Exhibit that summarizes the rules translating the diagnosis codes into actual conditions, their support, confidence and lift may be helpful). Point out their key characteristics. c. Briefly summarize the top 5 rules with the highest support (again an Exhibit may be helpful). Point out their key characteristics. d. Comment on the potential role of these rules as features for a prediction model. In order to use association rules to distinguish high risk patients from low risk patients, we need to identify rules that have good support and high confidence on high cost population, but low support for the not-high-cost population. In particular, we are interested in identifying rules of the type: {group of diagnosis codes} -> {High cost in a future period} The variable TA2 contains the overall cost in the year following the observation period. Define a new variable that equals one if the overall cost is ≥ $20,000, and zero otherwise. Run the Association Rule algorithm again using all diagnosis variables (continue to exclude the diabetes diagnosis) and the new high-cost variable. Set the support as low as 0.1% and the keep the confidence at 10%. Identify the rules with high-cost in the consequent set (HINT: here the appearance parameter we pointed out in class may be helpful, alternatively run as you did before and sort the rules afterwards in R or Excel). a. How many rules of the form {group of diagnosis codes} -> {High cost in a future period} did you identify ? b. Briefly summarize the top 5 rules with the highest confidence (an Exhibit that summarizes the rules translating the diagnosis codes into actual conditions, their support, confidence and lift may be helpful). Point out their key characteristics. c. Briefly summarize the top 5 rules with the highest support (again an Exhibit may be helpful). Point out their key characteristics. d. Comment on the potential role of these rules as features for a prediction model. In order to use association rules to distinguish high risk patients from low risk patients, we need to identify rules that have good support and high confidence on high cost population, but low support for the not-high-cost population. In particular, we are interested in identifying rules of the type: {group of diagnosis codes} -> {High cost in a future period} The variable TA2 contains the overall cost in the year following the observation period. Define a new variable that equals one if the overall cost is ≥ $20,000, and zero otherwise. Run the Association Rule algorithm again using all diagnosis variables (continue to exclude the diabetes diagnosis) and the new high-cost variable. Set the support as low as 0.1% and the keep the confidence at 10%. Identify the rules with high-cost in the consequent set (HINT: here the appearance parameter we pointed out in class may be helpful, alternatively run as you did before and sort the rules afterwards in R or Excel). a. How many rules of the form {group of diagnosis codes} -> {High cost in a future period} did you identify ? b. Briefly summarize the top 5 rules with the highest confidence (an Exhibit that summarizes the rules translating the diagnosis codes into actual conditions, their support, confidence and lift may be helpful). Point out their key characteristics. c. Briefly summarize the top 5 rules with the highest support (again an Exhibit may be helpful). Point out their key characteristics. d. Comment on the potential role of these rules as features for a prediction model.
Expert Answer:
Answer rating: 100% (QA)
a A total of 223 rules of the form group of diagnosis codes High cost in a future period were identified b Rule Diagnosis Codes Support Confidence Lif... View the full answer
Related Book For
Management Accounting Information for Decision-Making and Strategy Execution
ISBN: 978-0137024971
6th Edition
Authors: Anthony A. Atkinson, Robert S. Kaplan, Ella Mae Matsumura, S. Mark Young
Posted Date:
Students also viewed these accounting questions
-
Obtain decision thresholds for this set of scenarios for Diesel and Gasoline prices: Diesel: $4.199 and Gasoline: $3.719 $4.299 and $3.709 $4.249 and $3.699. Provide a written description of this...
-
Planning is one of the most important management functions in any business. A front office managers first step in planning should involve determine the departments goals. Planning also includes...
-
The Crazy Eddie fraud may appear smaller and gentler than the massive billion-dollar frauds exposed in recent times, such as Bernie Madoffs Ponzi scheme, frauds in the subprime mortgage market, the...
-
Please answer the following. Show your calculations for potential partial credit. Assume the expected return on the market is 14 percent and the risk-free rate is 4 percent. A.)What is the expected...
-
Use the Rolling Hills, Inc. data from Problem P14-34A. Requirements 1. Prepare the 2018 statement of cash flows by the direct method. 2. How will what you learned in this problem help you evaluate an...
-
How are foreign exchange gains and losses calculated when translating from local currencyto functional currency?
-
How a third party can intervene in an action?
-
King Company is contemplating the purchase of a smaller company, which is a distributor of Kings products. Top management of King is convinced that the acquisition will result in significant...
-
A couple thinking about retirement decide to put aside $ 1 9 , 0 0 0 each year in a savings plan that earns 1 2 . 2 0 % interest. In 9 years they will receive a gift of $ 1 4 1 , 5 0 0 that also can...
-
"Part 1: The Performance Lawn Equipment database contains data needed to develop a pro forma income statement. Dealers selling PLE products all receive 18% of sales revenue for their part of doing...
-
For each value of w, determine whether it is a solution to 53>=8w+5.
-
Critical analysis of the key literature of your chosen protected characteristics by reflecting on differing notions and opinions , and considering workplace reality in the UK. Include theory in your...
-
The company also will increase sales of its cheap clubs by 11,400 sets. The cheap clubs sell for $415 and have variable costs of $145 per set. The fixed costs each year will be $9,300,000. The...
-
Asteroids with a diameter of ~50m are estimated to collide with Earth every ~750 years. Such an event is expected to generate an airburst (atmospheric explosion) with an energy equal to 5.2 Megatons...
-
Project the future revenue growth rate for the next three years of the company. Explain the assumptions you used to predict the company's revenue growth rate(s) for the next three years. Project the...
-
Marie has had a whirlwind of a first semester of college, and now that finals are over, she is concerned about her weight gain during the past few months. After reading a bit about this issue, Marie...
-
Hydrogen bonding plays a central role in the following phenomena: (A) Ice floats in water. (B) Higher Lewis basicity of primary amines than tertiary amines in aqueous solutions. (C) Formic acid is...
-
Design an experiment to demonstrate that RNA transcripts are synthesized in the nucleus of eukaryotes and are subsequently transported to the cytoplasm.
-
Why might an organization not experience financial improvement even after using activity-based costing to identify and take action on promising opportunities for process improvements and cost...
-
Define and explain the role of measures, objectives, and targets, in the Balanced Scorecard strategy map.
-
Woodpoint Furniture Manufacturing produces various lines of pine furniture. The plant is organized so that all similar functions are performed in one area, as shown in Exhibit 7-17. Most pieces of...
-
This water jet of \(50 \mathrm{~mm}\) diameter moving at \(30 \mathrm{~m} / \mathrm{s}\) is divided in half by a "splitter" on the stationary flat plate. Calculate the magnitude and direction of the...
-
If the splitter is removed from the plate of Problem 4.75 problem and sidewalls are provided on the plate to keep the flow two dimensional, how will the jet divide after striking the plate? Data From...
-
A conical spray head is shown. The fluid is water and the exit stream is uniform. Evaluate (a) the thickness of the spray sheet at a radius of \(400 \mathrm{~mm}\) and (b) the axial force exerted by...
Study smarter with the SolutionInn App