Question: 1 . ( 5 pts ) Implement in Python a class to model hash functions from a certain family H of hash functions. ( Call

1 .

(

5

pts

)

Implement in Python a class to model hash functions from a certain family H of hash functions.

(

Call this class HashFamily.

)

The constructor for H should take n and p as arguments. The constructor should generate a random set of p distinct indices from

{

0

,

1

,

.

.

.

,

-

1

}

and remember this set, call it S

.

This class should have a method hash

(

)

which computes the hash value of any n

-

bit number x as follows. The method first extracts

{

[

]

: i in S

}

.

It then formulates the p

-

bit number obtained from this set in order of increasing index and returns this p

-

bit number in decimal.

2 .

(

5

pts

)

Implement in Python a Bloom Filter

(

)

as a class.

1 .

The constructor should take m

,

,

and n as arguments, where m and k are as done in class and n is the number of bits in an input.

(

Its okay to limit m to be a power of

2 .

)

The constructor should then create k random hash functions as HashFamily objects with arguments n

=

n and p

=

log

_

2

(

)

.

It maintains these hash functions in its state.

2 .

Support insert and lookup as done in class.

3 .

Support a bash insert method

-

a wrapper around insert

-

as a convenience method

(

see next question

)

.

3 .

(

7

pts

)

In this question, the aim is to empirically study how various parameter settings impact the false positive rate on simulated data.

1 .

Generate

10000

64

-

bit numbers at random and store this list somewhere. Call it S

.

2 .

for m in

[

64

,

128

,

256

,

512

,

1024

,

2048

,

.

.

.

,

65536

]

1 .

for k in

[

1

,

2

,

4

,

8

,

.

.

.

,

/

2

]

1 .

Create a BF with params m

=

,

=

,

=

64 .

2 .

for q in

[

,

2

,

4

,

8

,

16

,

32

,

64

]

1 .

Generate q n

-

bit numbers and batch insert them all into the BF

.

(

Note that these are inserted into an existing BF

.

)

2 .

Generate

1000

random n

-

bit numbers from scratch and lookup each of them in the BF

.

Calculate the proportion of these

1

K numbers on which the BF answers YES. The higher this proportion the higher the false positive rate.

3 .

Report your results in a table whose rows correspond to

(

,

,

)

one row per triplet.

4 .

Analyze your results to draw meaningful conclusions about the impact of the various parameter settings on the false positive rate.

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!

( 5 pts ) Implement in Python a class to model hash functions from a certain family H of hash functions. ( Call this class HashFamily. ) The constructor for H should take n and p as arguments. The...

*******PLEASE ANSWER IN PYTHON ONLY********* Learner Objectives ----------------- At the conclusion of this programming assignment, participants should be able to: Implement hash tables and hash...

*******PLEASE ANSWER IN PYTHON ONLY********* PA4 Maps (100 pts) Due: Learner Objectives ----------------- At the conclusion of this programming assignment, participants should be able to: Implement...

********PLEASE ANSWER IN PYTHON ONLY********* PA4 Maps (100 pts) Due: Learner Objectives ----------------- At the conclusion of this programming assignment, participants should be able to: Implement...

*******PLEASE ANSWER IN PYTHON ONLY********* Learner Objectives ----------------- At the conclusion of this programming assignment, participants should be able to: Implement hash tables and hash...

PA4 Maps (100 pts) Due: Learner Objectives ----------------- At the conclusion of this programming assignment, participants should be able to: Implement hash tables and hash functions Linear probing...

I am having trouble implementing hash tables in python. Lines 111-394 are all that need to be modified, any help would be appreciated. #!/usr/bin/python3 import unittest import math from enum import...

Python and most Python libraries are free to download or use, though many users use Python through a paid service. Paid services help IT organizations manage the risks associated with the use of...

use python do the work in every space where it says #YOUR CODE HERE A Sock Drawer Let us model a sock drawer. How does a sock drawer work? Well, when you do the washing you end up with a bunch of...

Need help getting started on these questions. I am supposed to add code where it says "implement me" and write the answer where it says answer in one or two line. Need to fill in the "Implement me"...

The matrices A and B +=(1) A B = are commuting (i. e. AB = BA). Find x and y. [Marks= 5]

Which category of e-business (B2B, B2C, G2B, or G2C) best describes each of the following items? a. Buying materials for professional practice from www.aicpa.org b. Electronic reporting of state...

A firm's P / E ratio is high compared to its competitors. What could this indicate? Investors expect high growth rates or the stock is overvalued Investors expect slowing growth rates or the stock is...

In unsolicited proposals convincing readers that a problem or an opportunity exists can be considered the _ _ _ _ _ _ _ _ . A ) scope B ) background or statement of the problem C ) solution D )...

Why are employees considering union representation?

Explain the use of ombudspersons and alternative dispute resolution.

What is the total annual turnover rate?