Question: Problem 2 (15 points) With the following matrix: Element Si S2 S3 S4 0 0 1 0 1 1 0 1 0 2 1 0

Problem 2 (15 points) With the following matrix: Element Si S2 S3 S4 0 0 1 0 1 1 0 1 0 2 1 0 O 1 3 0 0 1 4 0 0 1 1 a) (10 pts) Compute the minhash signature for each column if we use the following three hash functions: h1(x) = 2x + 3 mod 5; h2(x) = 3x + 2 mod 5; h3(x) = 4x + 1 mod 5. b) (5 pts) How close are the estimated Jaccard similarities for the six pairs of columns to the true Jaccard similarities? || OO
Step by Step Solution
There are 3 Steps involved in it
Get step-by-step solutions from verified subject matter experts
