Consider a document-term matrix, where tfij is the frequency of the ith word (term) in the jth

Question:

Consider a document-term matrix, where tfij is the frequency of the ith word (term) in the jth document and m is the number of documents. Consider the variable transformation that is defined by
Consider a document-term matrix, where tfij is the frequency of

where dfi is the number of documents in which the ith term appears and is known as the document frequency of the term. This transformation is known as the inverse document frequency transformation.
(a) What is the effect of this transformation if a term occurs in one document? In every document?
(b) What might be the purpose of this transformation?

Fantastic news! We've Found the answer you've been seeking!

Step by Step Answer:

Related Book For  book-img-for-question

Introduction to Data Mining

ISBN: 978-0321321367

1st edition

Authors: Pang Ning Tan, Michael Steinbach, Vipin Kumar

Question Posted: