Question: Theme: Understanding Basic Information Retrieval System Task: Index the document collection provided for this assignment, and compare the retrieval performance using different weighting methods. Weighting

 Theme: Understanding Basic Information Retrieval System Task: Index the document collectionprovided for this assignment, and compare the retrieval performance using different weightingmethods. Weighting Method: (i) TFIDF, (ii) BM25 You may choose to use

Theme: Understanding Basic Information Retrieval System Task: Index the document collection provided for this assignment, and compare the retrieval performance using different weighting methods. Weighting Method: (i) TFIDF, (ii) BM25 You may choose to use any variant of the above two weighting methods. Evaluation Matrix: Precision@k, AveragePrecision@k, MAP@k, Recall@k, AverageRecall@k, MAR@k. You also compare the weighting methods by plotting MAP@k vs MAR@k. The values of k can be considered k=10,20,30, and so on as appropriate. You would need to report the performance of the systems by considering the above two scenarios. (i) evaluate considering only the respective body of the title query. You will have only one relevant document for this setup. Do for all titles. (ii) evaluate considering top 30 retrieved documents with manual annotation. You may do this for at least 5 queries of your choice. Retrieval System: There are many open source retrieval system. You may choose any retrieval system of your choice. Some of such systems are (i)Whoosh, (ii) solr, (iii) elasticsearch. Considering the followup assignments, you may choose Whoosh, but not necessary

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!