Question: 2. Consider we have a term-document matrix for four words in three documents shown in Table 3. The whole document set has N = 20

2. Consider we have a term-document matrix for four words in three documents shown in Table 3. The whole document set has N = 20 documents, and for each of the four words, the document frequency dl, is shown in Table 4. term-document Docl Doc2 Doc3 car 27 14 24 insurance 3 18 0 auto 0 33 29 bet 14 Table 3: Term-document Matrix insurance | 6 auto 10 bet 16 Table 4: Document Frequency (a) Compute the t/-id/ weights for each word car, auto insurance and best in Doel, Doc2, and Doc3. (6 pts] (b) Use the t-id/ weight you get from (a) to represent each document with a vector and calculate the cosine similarities between these three documents. 4 pts 2. Consider we have a term-document matrix for four words in three documents shown in Table 3. The whole document set has N = 20 documents, and for each of the four words, the document frequency dl, is shown in Table 4. term-document Docl Doc2 Doc3 car 27 14 24 insurance 3 18 0 auto 0 33 29 bet 14 Table 3: Term-document Matrix insurance | 6 auto 10 bet 16 Table 4: Document Frequency (a) Compute the t/-id/ weights for each word car, auto insurance and best in Doel, Doc2, and Doc3. (6 pts] (b) Use the t-id/ weight you get from (a) to represent each document with a vector and calculate the cosine similarities between these three documents. 4 pts
Step by Step Solution
There are 3 Steps involved in it
Get step-by-step solutions from verified subject matter experts
