Question: For this assignment, use the following term/document matrix as the corpus. A 1 A 2 A 3 B 1 B 2 C 1 cloud 1
For this assignment, use the following term/document matrix as the corpus.
| A1 | A2 | A3 | B1 | B2 | C1 | |
|---|---|---|---|---|---|---|
| cloud | 1 | 1 | 0 | 1 | 0 | 0 |
| rain | 5 | 5 | 0 | 0 | 0 | 0 |
| sky | 5 | 2 | 6 | 0 | 0 | 0 |
| Watson | 0 | 0 | 0 | 5 | 5 | 0 |
| blue | 3 | 0 | 4 | 2 | 2 | 0 |
| deep | 0 | 0 | 0 | 2 | 2 | 0 |
| information | 0 | 0 | 0 | 0 | 0 | 1 |
| retrieval | 0 | 0 | 0 | 0 | 0 | 1 |
The query is clouds and rain, you should assume stemming and stopword removal will be performed on the query.
Explain how a system supporting query expansion could extend the BIM to give similar results to that of LSI. Demonstrate on the above query. Would you expect this to happen if we used a standard thesaurus and synonyms for query expansion
Step by Step Solution
There are 3 Steps involved in it
Get step-by-step solutions from verified subject matter experts
