Question: 2. Consider we have a term-document matrix for four words in three documents shown in Table 3. The whole document set has N = 20

 2. Consider we have a term-document matrix for four words in

2. Consider we have a term-document matrix for four words in three documents shown in Table 3. The whole document set has N = 20 documents, and for each of the four words, the document frequency dl, is shown in Table 4. term-document Docl Doc2 Doc3 car 27 14 24 insurance 3 18 0 auto 0 33 29 bet 14 Table 3: Term-document Matrix insurance | 6 auto 10 bet 16 Table 4: Document Frequency (a) Compute the t/-id/ weights for each word car, auto insurance and best in Doel, Doc2, and Doc3. (6 pts] (b) Use the t-id/ weight you get from (a) to represent each document with a vector and calculate the cosine similarities between these three documents. 4 pts 2. Consider we have a term-document matrix for four words in three documents shown in Table 3. The whole document set has N = 20 documents, and for each of the four words, the document frequency dl, is shown in Table 4. term-document Docl Doc2 Doc3 car 27 14 24 insurance 3 18 0 auto 0 33 29 bet 14 Table 3: Term-document Matrix insurance | 6 auto 10 bet 16 Table 4: Document Frequency (a) Compute the t/-id/ weights for each word car, auto insurance and best in Doel, Doc2, and Doc3. (6 pts] (b) Use the t-id/ weight you get from (a) to represent each document with a vector and calculate the cosine similarities between these three documents. 4 pts

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!