Question: Text categorization: given the following document - term matrix: ( the value in the matrix represents the frequency of a specific term in that document

Text categorization: given the following document-term matrix: (the value in the
matrix represents the frequency of a specific term in that document)
Assume that documents have been manually assigned to two pre-specified categories
as follows: Class_1={Doc1, Doc2, Doc5}, Class_2={Doc3, Doc4, Doc6, Doc7}
(a) Using Nave Bayes Multinomial Model to respectively calculate how Doc 8 and
Doc 9 given above will be classified. Please use add-one smoothing to process the
conditional probabilities in the calculation.
Feature Selection: given the following document-term matrix the above problem: (1)
calculate the mutual information between each term and the two the classes and select
the top four most important terms (features) for each class; (2) use the top four
features selected for each class, re-categorize Doc8 and Doc9 based on Nave
Bayes Multinomial Model.
Text categorization: given the following document

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Programming Questions!