Question: Task 2: Heavy Hitter Discovery (15 Points): Given a set of n values V={v1,v2,,vn} from n users, where the values are from a bounded domain

Task 2: Heavy Hitter Discovery (15 Points): Given a set of

Task 2: Heavy Hitter Discovery (15 Points): Given a set of n values V={v1,v2,,vn} from n users, where the values are from a bounded domain D. Suppose each value vi is represented as a binary string with length m (e.g., when m=4,vi 's value is 7 , then vi=0111;vi 's value is 8 , then vi=1000 ). The naive approach of querying the frequency of each string requires 2m oracle queries and is infeasible when m is large. Now your goal is to design a LDP protocol to identify the top- k heavy hitter, i.e., the k most frequent values in V, such that it is computationally feasible to query the frequency oracle. - Straw man protocol (4 Points): A length- m value v is divided into g equal-size segments, each of length s=m/g. In this protocol, each user randomly chooses a segment to report, and the aggregator first queries the frequency of each length- s binary string in each of the g segments, and then identify the frequent patterns in each segment, where are denoted as C1,C2,,Cg. The candidate set C is the Cartesian product of {Ci} 's, i.e., C=C1C2Cg, where Cartesian product operation is defined as C1C2={cicj:ciC1 and cjC2}, and is the string concatenation operation. Finally, the aggregator queries frequencies of the strings in candidates C. Answer the following questions: Page 1 CS528 Data Security and Privacy Instructor: Binghui Wang Assignment 2 (Due: 02/19/2023) 1. 2 Points. What is the number of total frequency oracle queries using this protocol? 2. 2 Points. What is the size of the candidate set C for top- k hevay hitter discovery? - Segment pair protocol (4 Points.) This protocol improves upon the Straw man protocol. The key differences is that, instead of reporting only one segment from g segments, each user reports a pair of two randomly chosen segments. The detailed protocol is as follows: First, the aggregator identifies the frequent patterns in each of the g segments. Then, it queries, for each pair i,j of segments, the frequency for the values in CiCj and identifies the value pairs that are frequent in segments i,j. From the frequent value pairs for each pair of segments, the aggregator recovers candidates for frequent values for the whole domain, using the a priori principle that if a value vD is frequent, every pair of its segments must also be frequent. Answer the following questions: 1. 2 Points. What is the number of total frequency oracle queries using this protocol? 2. 2 Points. What is the expected number of user reports on each pair of segments? - Prefix Extending protocol (7 Points.) Assume you want to identify k=150 most frequent values using the Prefix Extending protocol. The input domain D is 15 bytes (i.e., 120 bits), and you want to limit the total number of frequency oracle queries to no more than 228. 1. 3 Points. How to design the Prefix Extending protocol? 2. 4 Points. Which frequency oracles can be used to achieve high accuracy

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!

Task 1: Frequency Oracle (10 Points): We want each user to report a value that has a domain of d=100 values, in a way that satisfy -local differential privacy for =ln4. - 2 Points. When using...

(a) In SystemVerilog, what is the difference between: (i) The ternary operator ? and if...then...else statements? [2 marks] (ii) always_ff and always_comb? [2 marks] (iii) Blocking, non-blocking and...

seventh pages Chapter 3 Curve Sketching How much metal would be required to make a 400-mL soup can? What is the least amount of cardboard needed to build a box that holds 3000 cm3 of cereal? The...

Briefly describe ASCII and Unicode and draw attention to any relationship between them. [3 marks] (b) Briefly explain what a Reader is in the context of reading characters from data. [3 marks] A...

answer the question clearly You are building a flight-control system for which a convincing safety case must be made. Would you assign the tasks of safety requirements engineering, test case...

data mining subject 1- summary the artical 2-what is data size 3- recoreds applied 4-what techqinecs is used 5- explain resualts EMPIRICAL STUDY ON SELECTION OF TEAM MEMBERS FOR SOFTWARE PROJECTS -...

re Regular Languages and Finite Automata (a) Let L be the set of all strings over the alphabet {a, b} that end in a and do not contain the substring bb. Describe a deterministic finite automaton...

CS 7641 CSE/ISYE 6740 Homework 3 Le Song Deadline: 11/07 Mon, 11:55pm Submit your answers as an electronic copy on T-square. No unapproved extension of deadline is allowed. Zero credit will be...

Describe how to construct the function cpo ((D E), v) of two cpos (D, vD) and (E, vE). Prove that ((D E), v) is a cpo. (You may use facts about least upper bounds provided you state them clearly.)...

\fCOMPLEX VARIABLES AND APPLICATIONS Eighth Edition James Ward Brown Professor of Mathematics The University of Michigan-Dearborn Ruel V. Churchill Late Professor of Mathematics The University of...

At the Earths surface a projectile is launched straight up at a speed of 10.0 km/s. To what height will it rise? Ignore air resistance and the rotation of the Earth.

1 8. Verify that Ento 2i n function illustrated in Figure 6, defined by f(0) = 0, and etx f(x) = is the Fourier series of the 2-periodic sawtooth 2 2 KIN - 0 x 2 x 2 Note that this function is not...

Refer to the results in problem D.15, problem

Consider a 12-year annuity-immediate with payments of 8, 9, 10, ..., 19. Which of the following formulas gives the PV of this annuity? Possible Answers A PV = 7+(1a) 12 B PV = 80 121 + (Ia) 11] CPV =...

A How can hearing people become better listeners when they communicate with deaf people? How can deaf people improve their listening skills in the hearing world?

We know that manners are culturally bound (rude behavior in one culture may be acceptable behavior in another). Taking into account your culture and co-cultures (including age, gender, and so on), do...

A What kinds of activities do you know distract you and affect your ability to listen? Are there any tasks that you feel you can do without affecting your listening?