Question: A library management system is used to store information about books. Each book can be associated with a set of tags. The system allows users
A library management system is used to store information about books. Each book can
be associated with a set of tags. The system allows users to search for books by tag.
Consider the following book collection which has books:
Let Dict be a dictionary which consists of tags: Dict Science Fiction,
Adventure, NonFiction
a Denote by the tag frequency TF of the tag in the book Please fill out the
blank cells in the following table, ie give the values The value should
be put into the cell specified by and Note you can copy the form to your answer
sheet and the fill it out.
b Recall the inverse document frequency IDF is defined as idf Here
denotes the number of books in the collection denotes the number of books from
that contains tag Please compute idf and idf c Tag frequency inverse document frequency TFIDF takes both tag frequency TF and inverse document frequency IDF into consideration. For the book collection the tag s TFIDF value on book is defined as ie the product of s TF value on and s IDF value. Please compute and
d Using TFIDF, a book can be represented by a multidimension vector of TFIDF values of all tags in the dictionary. Compute the vector for each book.
e Similarly, any given query can be represented by a multidimension vector. Compute the vector for the query Science Fiction
f Using document vectors, we can compute the relevance score for each book to a given query using cosine similarity between the book vector and the query vector. A retrieval result of a query is the ranking of the books in the decreasing order of relevance scores. What is the result of the query Science Fiction
g What is the result of the query Science Explain why. Which solution do you suggest to include B and B in the result.
Step by Step Solution
There are 3 Steps involved in it
1 Expert Approved Answer
Step: 1 Unlock
Question Has Been Solved by an Expert!
Get step-by-step solutions from verified subject matter experts
Step: 2 Unlock
Step: 3 Unlock
