Question: a. [10 pts] Write down the formula for Dirichlet Prior Smoothing. Then, mathematically prove the following two lemmas: O Show, in the limit where
a. [10 pts] Write down the formula for Dirichlet Prior Smoothing. Then, mathematically prove the following two lemmas: O Show, in the limit where document length tends to infinity, that a unigram language model smoothed with a Dirichlet prior becomes equivalent to one estimated using the maximum likelihood estimate. Show, in the limit where the parameter tends to infinity, that a unigram language model smoothed with a Dirichlet prior becomes equivalent to the background language model used in the smoothing. b. [5 pts] Point out one advantage of Jelinek-Mercer smoothing over Katz-Backoff smoothing. Explain why.
Step by Step Solution
3.38 Rating (157 Votes )
There are 3 Steps involved in it
Dirichlet Prior Smoothing is a method used in language modeling to address the sparsity problem enco... View full answer
Get step-by-step solutions from verified subject matter experts
