Question: Artificial Intelligence Nave Bayes is frequently applied to categorizing text (e.g. documents, web pages) based on the presence or absence of different words. Assume we

Artificial Intelligence

Nave Bayes is frequently applied to categorizing text (e.g. documents, web pages) based on the presence or absence of different words. Assume we want to categorize science texts into the following categories: Physics, Biology, and Chemistry. The following probabilities have been estimated from analyzing a corpus of pre-classified web pages gathered from Yahoo:

Artificial Intelligence Nave Bayes is frequently applied to categorizing text (e.g. documents,

Assuming the probability of each evidence word is independent given the category of the text, compute the posterior probability for each of the possible categories for each of the following short texts. Assume the categories are disjoint and complete for this application. Assume that the words are first stemmed to reduce them to their base form (atoms -> atom, forces -> force, protons -> proton). a.) The carbon atom is the foundation of life on earth. b.) The carbon atom contains twelve protons. c.) String theory attempts to unify all the forces on earth.

P(m) P('atomm) P('carbon' m) POproton' m) P(life' m) P(-earth' m) P(-force' m) Physics 0.35 0.10 0.005 0.05 0.001 0.005 0.05 Biology 0.40 0.01 0.03 0.001 0.10 0.006 0.005 Chemistry 0.25 0.20 0.05 0.05 0.008 0.003 0.008

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!