Question: a. [10 pts] Write down the formula for Dirichlet Prior Smoothing. Then, mathematically prove the following two lemmas: O Show, in the limit where

a. [10 pts] Write down the formula for Dirichlet Prior Smoothing. Then, mathematically prove the following

a. [10 pts] Write down the formula for Dirichlet Prior Smoothing. Then, mathematically prove the following two lemmas: O Show, in the limit where document length tends to infinity, that a unigram language model smoothed with a Dirichlet prior becomes equivalent to one estimated using the maximum likelihood estimate. Show, in the limit where the parameter tends to infinity, that a unigram language model smoothed with a Dirichlet prior becomes equivalent to the background language model used in the smoothing. b. [5 pts] Point out one advantage of Jelinek-Mercer smoothing over Katz-Backoff smoothing. Explain why.

Step by Step Solution

3.38 Rating (157 Votes )

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock

Dirichlet Prior Smoothing is a method used in language modeling to address the sparsity problem enco... View full answer

blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Programming Questions!