Question: Question 1 ( 3 0 points ) : Consider the following shingle - document matrix. a ) Compute the signature matrix using 4 different permutations

Question 1(30 points): Consider the following shingle-document matrix.
a) Compute the signature matrix using 4 different permutations of your choice
b) Compute all pairwise column similarities for both the ColCol and SigSig
c) Compute the minhash signature for each column if we use the following three hash
functions: [Hint: Substitute the element index in the hash function to generate the
permutation]
[Example of modulus function:
45mod6
45=6**7+(3)
45 when divided by 6, leaves a remainder 3. So 45 mod 6=3.]
i)h1(x)=x+1mod6;
ii)h2(x)=x+4mod6;
iii)h3(x)=2x+5mod6
 Question 1(30 points): Consider the following shingle-document matrix. a) Compute the

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!