Question: Text categorization: given the following document - term matrix: ( the value in the matrix represents the frequency of a specific term in that document
Text categorization: given the following documentterm matrix: the value in the
matrix represents the frequency of a specific term in that document
Assume that documents have been manually assigned to two prespecified categories
as follows: ClassDoc Doc Doc ClassDoc Doc Doc Doc
a Using Nave Bayes Multinomial Model to respectively calculate how Doc and
Doc given above will be classified. Please use addone smoothing to process the
conditional probabilities in the calculation.
Feature Selection: given the following documentterm matrix the above problem:
calculate the mutual information between each term and the two the classes and select
the top four most important terms features for each class; use the top four
features selected for each class, recategorize Doc and Doc based on Nave
Bayes Multinomial Model.
Step by Step Solution
There are 3 Steps involved in it
1 Expert Approved Answer
Step: 1 Unlock
Question Has Been Solved by an Expert!
Get step-by-step solutions from verified subject matter experts
Step: 2 Unlock
Step: 3 Unlock
