Question: ( Exercise 3 . 4 & 3 . 7 of chapter 3 of SLP book ) Given the following corpus: I a m Sam Sam

(Exercise 3.4 & 3.7 of chapter 3 of SLP book) Given the following corpus:
I am Sam
Sam I am
s> Sam I like
s> Sam I do like
s>do I like Sam
(a) Using a bigram language model with add-one smoothing, what is P(Sam|am)?
Include in your counts just like any other token
(b) If we use linear interpolation smoothing between a maximum-likelihood bigram
model and a maximum-likelihood unigram model with 1=12 and 2=12, what is
P(Sam|am)? Include in your counts just like any other token.
(c) Which of the following sentences gets a higher probability with this model?
I am Sam
Sam I am
(d) What is the perplexity of this model if evaluated on a toy test dataset that contains
only one sentence?
( Exercise 3 . 4 & 3 . 7 of chapter 3 of SLP book

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Programming Questions!