Question: Problem 1 . 4 . Consider the query string and documents as below: All texts should be converted to lowercase and tokenized by the whitespaces
Problem Consider the query string and documents as below: All texts should be
converted to lowercase and tokenized by the whitespaces or punctuations.
Query:covid vaping study
Documents: d: Vaping teens and young adults up to seven times more likely to contract
COVID study finds. d: Jasper Landgraab is now a young adult, he is off to university to study fine
art. d: New study shows that vaping causes anxiety and depression.
Compute the relevancy score of each document by using the tfidf scheme as defined
below.
scoreqdi
tf logftd
idf log
N
tfidft,d
t in Q
dft
where ftd is the raw frequency of the term in the document, dft is the document frequency
of the term, and N is the total number of documents.
Step by Step Solution
There are 3 Steps involved in it
1 Expert Approved Answer
Step: 1 Unlock
Question Has Been Solved by an Expert!
Get step-by-step solutions from verified subject matter experts
Step: 2 Unlock
Step: 3 Unlock
