In this problem, we will combine ideas from Count- min sketch for finding heavy-hitters with the...
Fantastic news! We've Found the answer you've been seeking!
Question:
Transcribed Image Text:
In this problem, we will combine ideas from Count- min sketch for finding heavy-hitters with the Alon-Matias-Szegedy algorithm for estimating the 2 frequency moment of a stream. This will allow us to estimate heavy hitters of a stream with a tighter guarantee in certain cases. Recall that in Count-Min Sketch, we maintained d hash functions h,..., hd, corresponding to d hash tables, each of size w. For the datum that appears at time t, (it, ct) where it is the identifier, and ct is a count, for each j = [d], we increment a counter C; in entry h; (it) of the jth hash table by c. At the end of the stream, for a given identifier i, we can return fi = minjeld] C; (h; (i)) to get an estimate of fi - Etiti. In particular, setting w = 0(1/E) and d = O(log(1/8)), with probability at least 1-8, this will give an estimate fi - fil < F, where F = fi (we assume that fi 20 for all i). Consider making the following changes to the algorithm. Instead of storing just d hash functions, we instead store 2d hash functions. The second set of hash functions, 91,..., 9d maps to the range {+1}. The modification to counter C; at time t is still at entry h; (it), but now we increment it by gj(it)ct. Finally, our estimate fi is now median jeld] 95 (i)C; (h; (i)). We will obtain a guarantee which is in terms of VF2, where F = f. Let fij = 9; (i)C,(h, (i)). (a) For some given i and j, compute E[fi]. (b) For some given i and j, upper bound Var[fij]. (c) Given these two quantities, choose values of d and w, upper-bounding the probability that fij-fil 22 by a constant, and (in turn) upper-bounding the probability that fi - fil EVF by 8. (d) Compare this type of guarantee with that of Count-Min Sketch. When is each guarantee better? Give a set of frequencies (i.e., a set of fi's) illustrating where one might be better than the other. In this problem, we will combine ideas from Count- min sketch for finding heavy-hitters with the Alon-Matias-Szegedy algorithm for estimating the 2 frequency moment of a stream. This will allow us to estimate heavy hitters of a stream with a tighter guarantee in certain cases. Recall that in Count-Min Sketch, we maintained d hash functions h,..., hd, corresponding to d hash tables, each of size w. For the datum that appears at time t, (it, ct) where it is the identifier, and ct is a count, for each j = [d], we increment a counter C; in entry h; (it) of the jth hash table by c. At the end of the stream, for a given identifier i, we can return fi = minjeld] C; (h; (i)) to get an estimate of fi - Etiti. In particular, setting w = 0(1/E) and d = O(log(1/8)), with probability at least 1-8, this will give an estimate fi - fil < F, where F = fi (we assume that fi 20 for all i). Consider making the following changes to the algorithm. Instead of storing just d hash functions, we instead store 2d hash functions. The second set of hash functions, 91,..., 9d maps to the range {+1}. The modification to counter C; at time t is still at entry h; (it), but now we increment it by gj(it)ct. Finally, our estimate fi is now median jeld] 95 (i)C; (h; (i)). We will obtain a guarantee which is in terms of VF2, where F = f. Let fij = 9; (i)C,(h, (i)). (a) For some given i and j, compute E[fi]. (b) For some given i and j, upper bound Var[fij]. (c) Given these two quantities, choose values of d and w, upper-bounding the probability that fij-fil 22 by a constant, and (in turn) upper-bounding the probability that fi - fil EVF by 8. (d) Compare this type of guarantee with that of Count-Min Sketch. When is each guarantee better? Give a set of frequencies (i.e., a set of fi's) illustrating where one might be better than the other.
Expert Answer:
Answer rating: 100% (QA)
a To compute Efi for a given i and j we need to take the expectation ... View the full answer
Related Book For
Applied Regression Analysis and Other Multivariable Methods
ISBN: 978-1285051086
5th edition
Authors: David G. Kleinbaum, Lawrence L. Kupper, Azhar Nizam, Eli S. Rosenberg
Posted Date:
Students also viewed these programming questions
-
A student holds a water balloon outside of an open window and lets go. The window is 10 meters above the ground, and the balloon is falling under the acceleration of gravity, which is 9.8 m/s2. There...
-
Let A, B be sets. Define: (a) the Cartesian product (A B) (b) the set of relations R between A and B (c) the identity relation A on the set A [3 marks] Suppose S, T are relations between A and B, and...
-
1456HHSC attend all (a) In the quantum teleportation protocol, Alice and Bob are every in possession of one qubit of a couple in the joint country00i + statei. Explain how the protocol works. In...
-
In Problems 1158, perform the indicated operation, and write each expression in the standard form a + bi. 6i 3 - 4i 5
-
An amateur radio operator wishes to build a receiver that can tune a range from 14.0 MHz to 15.0 MHz. A variable capacitor has a minimum capacitance of 86 pF. (a) What is the required value of the...
-
What is the difference between tangential and radial acceleration for a point on a rotating body?
-
IFRS Framework 2018 states that relevance and faithful representation are the two fundamental qualitative characteristics of financial information. Requirement: (a) Briefly discuss what is meant by...
-
On December 1, 2016, Lynch Incorporated sold $18,000 of merchandise with terms 2/10, n/EOM. On December 11, 2016, collections were made on sales originally billed for $12,000, and on December 31,...
-
Let X and Y be jointly continuous random variables with joint PDF (cx + 1, x, y 0, x + y < 1 0, otherwise fxx(x, y) = {ex 1. Show the range of (X,Y) Rxy, in x - y plane. 2. Find the constant c. 3....
-
If the lead time in Example 12.1 changes from one week to two weeks, how is the optimal policy affected? Does the optimal order quantity change?
-
Sirius is mag - 1 . 4 . The faintest star observable at the Palomar Observatory is mag 2 3 . 6 . How many times brighter is Sirius than the faintest observable star.
-
2 3 Kayak Company budgeted the following cash receipts (excluding cash receipts from loans received) and cash payments (excluding cash payments for loan principal and interest payments) for the first...
-
Consider the following table, which gives a security analyst's expected return on two stocks in two particular scenarios for the rate of return on the market: Market Return Aggressive Stock Defensive...
-
es Pretzelmania, Incorporated, issues 7%, 10-year bonds with a face amount of $64,000 for $64,000 on January 1, 2024. Interest is paid semiannually on June 30 and December 31. Required: 1. & 2....
-
Mr. Job's salary at the end of 1995 was USS25, 000.00 per annum. At the end of each year thereafter he received an increase of 7% of the previous years' salary. Find his salary at the end of 2009
-
find the efficiency for this binary symmetric channel P(y\x)= [0.1 0.9 0.11
-
Galaxy Sports Inc. manufactures and sells two styles of All Terrain Vehicles (ATVS), the Conquistador and Hurricane, from a single manufacturing facility. The manufacturing facility operates at 100%...
-
Three forces with magnitudes of 70pounds, 40 pounds, and 60 pounds act on an object at angles of 30, 45, and 135, respectively, with the positive x-axis. Find the direction and magnitude of the...
-
This problem refers to the 1990 Census data presented in Problem 19 of Chapter 5. In addition to median selected monthly ownership costs (OWNCOST), another independent variable studied was the...
-
This question refers to the U.S. News & World Report mutual fund data presented in Problem 19 in Chapter 17. The variables described in that question were: CAT (fund category): 1 = Aggressive growth;...
-
Using the data from Problem 2 in Chapter 5 and/or the SAS output given here, answer the following questions about the separate straight-line regressions of SBP on QUET for smokers (SMK = 1) and...
-
The chapter says that a measurement system can affect a business in several ways. As an example, consider a worldwide company with headquarters in New York City and divisions in many different...
-
Under the No Child Left Behind Act, passed in 2001, every public elementary school that receives some federal funding is required to give its students certain tests of reading and math every year....
-
Consider a college transcript as the output of a measurement system. A. Identify the: a. Object being measured b. Attribute of the object being measured c. Rules for measurement d. Standard-setters...
Study smarter with the SolutionInn App