Question: Input: lm : the language model you trained (the object you returned from the train_ngram_lm function) data : test data vocab : vocab order :

Input:

lm: the language model you trained (the object you returned from the train_ngram_lm function)
data: test data
vocab: vocab
order: order of the lm

Output:

the perplexity of test data

Hint:

If the history is not in the lm object, back-off to (n-1) order history to check if it is in lm. If no history can be found, just use 1/|V| where |V| is the size of vocabulary.

[ ]

def compute_perplexity(lm, data, vocab, order=3):

# pad according to order

order -= 1

data = ['~~'] * order + data~~

for i in range(len(data) - order):

h, w = ' '.join(data[i: i+order]), data[i+order]

"""

IMPLEMENT ME!

# if h not in lm, back-off to n-1 gram and look up again

"""

Input:

lm: the language model you trained (the object you returned from the train_ngram_lm function)

data: test data

vocab: vocab

order: order of the lm

Output:

the perplexity of test data

Hint:

If the history is not in the lm object, back-off to (n-1) order history to check if it is in lm. If no history can be found, just use 1/|V| where |V| is the size of vocabulary.

def compute_perplexity(lm, data, vocab, order=3):

# pad according to order

order -= 1

data = ['~~'] * order + data~~

for i in range(len(data) - order):

h, w = ' '.join(data[i: i+order]), data[i+order]

"""

IMPLEMENT ME!

# if h not in lm, back-off to n-1 gram and look up again

"""

pass

for o in [1, 2, 3, 4]:

lm = train_ngram_lm(data['train'], order=o)

print('order {} ppl {}'.format(o, compute_perplexity(lm, data['test'], vocab, order=o)))

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock
Question Has Been Solved by an Expert!
Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!

Q:
The learning goals of this assignment are to: Understand how to compute language model probabilities using maximum likelihood estimation. Implement back - off. Have fun using a language model to...

Q:
Q1.2: Generate text from n-gram language model (25pts) Complete the following generate_text function based on these input/output requirements: Input: lm : the lm object, a dictionary you return from...

Q:
Tasks The goal of the project is to complete the code for the NgramAnalyser, MarkovModel, ModelMatcher and MatcherController classes, as detailed below, and to add test code to a new JUnit test...

Q:
Jupiter Notebook We have covered some of the limitations of single layer neural networks in class, but they are still powerful learning systems that provide a good way to begin learning about how to...

Q:
Give Correct ANSWERS Human-Computer Interaction (a) If you had been one of the original inventors of the WIMP interface, and engineers on the technical team had been sceptical about the advantages...

Q:
subject: Differential Equations pls read instructions do not use ai. drop all references and link Instructions ODE application. - find an article related to ODE application - provide a short...

Q:
can someone solve this Modern workstations typically have memory systems that incorporate two or three levels of caching. Explain why they are designed like this. [4 marks] In order to investigate...

Q:
Q 2 . 1 : Train N - gram language model ( 2 0 pts ) Complete the following train _ ngram _ lm function based on the following input / output specifications. If you've done it right, you should pass...

Q:
CS 50 Assignment 5: User Defined Functions Rules for Developing Software Using Functions Below is NOT a complete list of rules. Additional rules may be stated in lecture or are inferred by output...

Q:
This assignment is intended to make you do a lot of work with dynamic memory allocation, pointers, and arrays of pointers. Don\'t wait until the weekend it\'s due to start it! Your solution should...

Q:
LR Company's chart of accounts includes the following selected accounts. 112 Accounts receivable 401 Sales revenue 120 Inventory 412 Sales returns and discounts 126 Supplies 505 Cost of goods sold...

Q:
For the hook support of Problem 2.3, using trigonometry and knowing that the magnitude of P is 25 lb, determine (a) The required magnitude of the force Q if the resultant R of the two forces applied...

Q:
what colums do I need on excel sheet to calculate WACC

Q:
5.11 DETERMINATION OF LMCD, OVERALL VOLUMETRIC MASS TRANSFER COEFFICIENT AND HEIGHT OF TRANSFER UNITS: An aqueous solution containing 1.5 kmol X/m3 is fed at 36 ml/s to the top of packed column of...

Q:
2. Assume that candle wax is traded in a perfectly competitive market in which the demand curve captures buyers full willingness to pay while the supply curve reflects all production costs. For each...

Q:
6. Match each of the following characteristics or scenarios with either the term negative externality or the term positive externality. LO4.4 a. Overallocation of resources. b. Tammy installs a very...

Q:
2. Use the ideas of consumer surplus and producer surplus to explain why economists say competitive markets are efficient. Why are below- or above-equilibrium levels of output inefficient, according...

Previous Question Next Question