Question: Problem 2 (25 Points): Suppose you are given the following data: he likes to sleep he likes to eat he likes to eat and sleep

Problem 2 (25 Points): Suppose you are given the following data:

Problem 2 (25 Points): Suppose you are given the following data: ~~he likes to sleep~~ he likes to eat ~~he likes to eat and sleep and eat~~ the thing he likes to eat is mint ~~the mint he likes to eat is pink~~ (a) (10 Points) Compute the probability that would be assigned to the string " ~~to sleep and eat~~ )" by a bigram language model with no smoothing. Don't consider the start and end sentence symbols. Just use the unigram of the first word to start. (b) (15 Points) Compute the probability that would be assigned to the string" ~~to sleep and eat~~ " by a bigram language model with linear interpolation for smoothing (with the bigram weight l set to 0.8 and the unigram weight 22 set to 0.2). Since the unigram model does not need to estimate P(>), ignore the start token when estimating the unigram model. Problem 2 (25 Points): Suppose you are given the following data: ~~he likes to sleep~~ he likes to eat ~~he likes to eat and sleep and eat~~ the thing he likes to eat is mint ~~the mint he likes to eat is pink~~ (a) (10 Points) Compute the probability that would be assigned to the string " ~~to sleep and eat~~ )" by a bigram language model with no smoothing. Don't consider the start and end sentence symbols. Just use the unigram of the first word to start. (b) (15 Points) Compute the probability that would be assigned to the string" ~~to sleep and eat~~ " by a bigram language model with linear interpolation for smoothing (with the bigram weight l set to 0.8 and the unigram weight 22 set to 0.2). Since the unigram model does not need to estimate P(>), ignore the start token when estimating the unigram model

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!

Name: E number: Math 1530 section number Question 1 (2.85 points) Eggs that are contaminated with salmonella can cause food poisoning among consumers. A large egg producer takes an SRS of 200 eggs...

Zhi plays a game in which she can purchase a ticket and it has several chances, or "catches" to win money. The table below shows the probability of winning at each stage, and how much money the...

incorrect document, we'll assume that the homework assignment was not actually completed. Background DBVac is considering adding an additional product to their lineup. The company's Research and...

In this assignment, you'll be writing a business case in which you will recommend to DBVac which of the three vacuums to bring to market, give them a sense for what to expect from its release in...

help me pliz Problem 4 (23 points) Firm X has the following production function (K, I) = (1 + K2/3). 1. (6 points) Find the marginal product of labor and capital and the marginal rate of technical...

SAMPLING MEAN: DEFINITION: The term sampling mean is a statistical term used to describe the properties of statistical distributions. In statistical terms, the sample mean from a group of...

ECO670...... 1. "Deriving the Envelope Thoerem: Consider the more general problem M(o, ) = max, f(x, a, 8) subject to g(r, a, 8) = 0. Show that: dM(a, B) _ Of(x', a, b) ag(x*, a, b) da da da 2. For...

QUESTION 1 Is the following sentence written in "parallel structure"? EX: The dress Annie bought at the consignment store was old, faded, and was lined with heavy wrinkles. CORRECT. INCORRECT 1...

Question 1 of 17 1.0 Points Members of the general adult population volunteer an average of 4.2 hours per week.A random sample of 20 female college students and 18 male college students produced the...

This question involves the use of AGGREGATE linear PYTHOIN regression on the Auto data set. (a) Perform a simple linear regression with mpg as the response and horsepower as the predictor. Describe...

Find the average of all prime numbers between 30 and 50.

(a) In the presence of dilute hydrochloric acid, compound A is converted to a constitutional isomer, compound B. Suggest a reasonable structure for compound B. (b) The trans stereoisomer of compound...

What can be used to distribute amounts of a journal entry across one or more dimensions? Sage Intacct

Ofrecer descuentos de ventas en ventas a crdito puede beneficiar a un vendedor al disminuir la demora en recibir efectivo y reducir los esfuerzos de cobro futuros. Seleccione uno: Verdadero FALSO

=+development and make the product, should you go ahead and do so?

=+reduced the expected sales of your new product to $3 million. If it would cost $1 million to finish

=+recent meeting, your salespeople report that the introduction of competing products has