Question: a. [10 pts] Write down the formula for Dirichlet Prior Smoothing. Then, mathematically prove the following two lemmas: o Show, in the limit where document

 a. [10 pts] Write down the formula for Dirichlet Prior Smoothing.

a. [10 pts] Write down the formula for Dirichlet Prior Smoothing. Then, mathematically prove the following two lemmas: o Show, in the limit where document length tends to infinity, that a unigram language model smoothed with a Dirichlet prior becomes equivalent to one estimated using the maximum likelihood estimate. o Show, in the limit where the parameter u tends to infinity, that a unigram language model smoothed with a Dirichlet prior becomes equivalent to the background language model used in the smoothing. b. [5 pts] Point out one advantage of Jelinek-Mercer smoothing over Katz-Backoff smoothing. Explain why. a. [10 pts] Write down the formula for Dirichlet Prior Smoothing. Then, mathematically prove the following two lemmas: o Show, in the limit where document length tends to infinity, that a unigram language model smoothed with a Dirichlet prior becomes equivalent to one estimated using the maximum likelihood estimate. o Show, in the limit where the parameter u tends to infinity, that a unigram language model smoothed with a Dirichlet prior becomes equivalent to the background language model used in the smoothing. b. [5 pts] Point out one advantage of Jelinek-Mercer smoothing over Katz-Backoff smoothing. Explain why

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!