Question: Consider the following dataset with two quasi-identifiers (age and zip) and one sensitive attribute (disease):...
Fantastic news! We've Found the answer you've been seeking!
Question:
Transcribed Image Text:
Question: Consider the following dataset with two quasi-identifiers (age and zip) and one sensitive attribute (disease): sn --- 1 P600 GAWNH 2 3 4 5 6 7 8 age 9 29 34 40 31 AWAWA W 40 32 41 46 37 10 33 11 47 12 47 zip --- 39056 15010 Part B: [10pts] Repeat part A for k=3. Part C: [10pts] Repeat part A for k=4. 58014 39056 56017 51100 82030 37060 39010 41038 39056 26010 Part A: [10pts] Apply the algorithm for k =2. A. What is the number of the resulting ECs? disease B. Compute the Discernibility Metric (CDM). C. Compute the /Loss/Utility (w.r.t. age). D. Indicate/Compute the distinct and entropy /-diversity. Heart disease Autism Autism Alzheimer Heart disease AIDS Apply the k-anonymization algorithm (greedy partitioning) discussed in class. The partitioning must be done w.r.t the two quasi-identifiers zip and age (in this order). Show all the steps - showing only the final answer will result in zero score. Autism Heart disease Anorexia Anorexia Alzheimer Heart disease Part D: [5pts] Plot CDM and /Loss/Utility VS the different k values (2, 3, and 4). State your observations/findings. Question: Consider the following dataset with two quasi-identifiers (age and zip) and one sensitive attribute (disease): sn --- 1 P600 GAWNH 2 3 4 5 6 7 8 age 9 29 34 40 31 AWAWA W 40 32 41 46 37 10 33 11 47 12 47 zip --- 39056 15010 Part B: [10pts] Repeat part A for k=3. Part C: [10pts] Repeat part A for k=4. 58014 39056 56017 51100 82030 37060 39010 41038 39056 26010 Part A: [10pts] Apply the algorithm for k =2. A. What is the number of the resulting ECs? disease B. Compute the Discernibility Metric (CDM). C. Compute the /Loss/Utility (w.r.t. age). D. Indicate/Compute the distinct and entropy /-diversity. Heart disease Autism Autism Alzheimer Heart disease AIDS Apply the k-anonymization algorithm (greedy partitioning) discussed in class. The partitioning must be done w.r.t the two quasi-identifiers zip and age (in this order). Show all the steps - showing only the final answer will result in zero score. Autism Heart disease Anorexia Anorexia Alzheimer Heart disease Part D: [5pts] Plot CDM and /Loss/Utility VS the different k values (2, 3, and 4). State your observations/findings.
Expert Answer:
Answer rating: 100% (QA)
To implement the kanonymization algorithm greedy partitioning on the given dataset with the quasiidentifiers age and zip we need to follow the specifi... View the full answer
Related Book For
Applied Regression Analysis and Other Multivariable Methods
ISBN: 978-1285051086
5th edition
Authors: David G. Kleinbaum, Lawrence L. Kupper, Azhar Nizam, Eli S. Rosenberg
Posted Date:
Students also viewed these programming questions
-
For the differential form (cos x)dx + ((1+2/y)sin x)dy = 0 show that the ODE is not exact and solve for the general solution using an Integrating factor that will make the ODE exact?
-
If the focal length of a lens is 3 centimeters and the image distance is 5 centimeters from the lens, what is the distance from the object to the lens?
-
The following information has been extracted from the trial balance of M/s Randhir Transport Corporation. Adjustments 1. Closing stock for the year was Rs. 35,500. 2. Depreciation charged on plant...
-
For what proportion of samples will a 90% confidence interval for a population mean not capture the true population mean?
-
Sketch the region enclosed by the given curves and find its area. y = x - 1, x - y = 1
-
What are the two categories of data mining and knowledge discovery software?
-
County Beverage Drive-Thru, Inc., operates a chain of beverage supply stores in Northern Illinois. Each store has a single service lane; cars enter at one end of the store and exit at the other end....
-
1. Prove: 1 + 2 + 3 + - + n = (n + 1) using lattice paths. 1+2+3++n= n+ 2 2. Solve a = 2n+1 - 1 with a = 1.
-
a) Complete the table below for the equation y = x 3 3x 2 9x + 2. x -3 -2 -1 0 1 2 3 4 5 y = x 3 3x 2 9x + 2 2 -25 (2 Marks) b) On the grid provided draw the graph of y = x 3 3x 2 9x + 2 for -3...
-
Is it really that difficult to communicate with someone of another gender, or do we just believe it's difficult because we've always been told it is? What is your take on this?
-
Find the minimum sample size n needed to estimate u for the given value: c=90, o=8.7, e=2 Assume that the preliminary sample has at least 30 members.
-
1. Explain EACH of the four basic elements necessary to the formation of a valid contract? What they are and definition 2. The Statute of Frauds requires certain types of contracts to be in writing...
-
Saheel has a charge card with a monthly rate of 1.4% on amounts up to $1,000 and 0.9% on amounts over $1,000. Saheel's previous balance was $1,421.51. He made a payment of $175, charged $423.89, and...
-
A U - tube that is open on both ends has a moveable disc in it that can move frictionlesslike a piston from left to right, but that keeps matter from crossing from one side of the tube to the other....
-
3. Think retrosynthesis! For each or the following reactions, we want to think about when we know to use them. Provide a structure of the functional group you are looking for in the product of each...
-
What is the purpose of the journal wizard?
-
A study was conducted to assess the combined effects of patient attitude and patient physician communication on patient satisfaction with medical care during pregnancy. A random sample of 110...
-
In Problem 3, the independent variable of interest, copper ion concentration, is an interval-scaled variable. a. Treating copper ion concentration as a five-level categorical predictor (as in the...
-
A psychosociological questionnaire was administered to a random sample of 200 persons on an island in the South Pacific that has become increasingly westernized over the past 30 years. From the...
-
A summary of the Balgreen Bowling Club's cash book is shown below. From it, and the additional information, you are to construct an income and expenditure account for the year ending 31 December...
-
Why do you think non-profit-oriented organizations prepare receipts and payments accounts when they have all this information in the cash book already?
-
The following trial balance of the Grampian Golf Club was extracted from the books as on 31 December 2016: (1) Bar purchases and sales were on a cash basis. Bar inventory at 31 December 2016 was...
Study smarter with the SolutionInn App