Question: 1 - Compute the bigram count table, C ( w 2 | w 1 ) for the following: A skunk sat on a stump. The
Compute the bigram count table, Cww for the following: "A skunk sat on a stump. The stump thunk the skunk stunk, the skunk thunk the stump stunk." Put w in the left hand column, and w in the top row. Include punctuation, clitics, and sentence start and end markers as individual tokens.
Compute the bigram probability table, Pww for the phrase A skunk sat on a stump. assuming the following overall unigram counts:
Ca Cskunk Csat Con Cstump Cthe Cthunk Cstunk CAssume there are sentences in the corpus, and they all end with a period.
Compute the probability and perplexity of the sentence in Question using the bigram approximation.
Smoothing
Smooth the count table you calculated in Question using Laplace smoothing, and recalculate the probability table as well. Assume V
Recalculate the probability and perplexity of the first sentence in Question using the smoothed table.
Step by Step Solution
There are 3 Steps involved in it
1 Expert Approved Answer
Step: 1 Unlock
Question Has Been Solved by an Expert!
Get step-by-step solutions from verified subject matter experts
Step: 2 Unlock
Step: 3 Unlock
