Question: Implement a python function build_unigram_probs(unigrams, unigram_counts, total_count) which takes a list of all of the unique words in the book, a dictionary mapping unique unigrams

Implement a python functionbuild_unigram_probs(unigrams, unigram_counts, total_count)which takes a list of all of the unique words in the book, a dictionary mapping unique unigrams to counts, and the total count of words in the book, and returns a new list of the probabilities of each word.

In order to do this, you should iterate through the indexes of the unigram list. Look up the count of the indexs corresponding unigram in unigram_counts, then divide the unigram count by the total_count to get the probability that the word at the same index in unigrams would be chosen at random from the book. Return the list of probabilities

def test_build_unigram_probs():

assert(build_unigram_probs(\

[ "hello", "world", "again"],

{ "hello" : 2, "world" : 2, "again" : 1 }, 5 ) == \

[ 2/5, 2/5, 1/5 ])

assert(build_unigram_probs(\

[ "hello", "and", "welcome", "to", "the", "program", ".", "we're", "happy", "have", "you"],

{ "hello" : 1, "and" : 1, "welcome" : 1, "to" : 2, "the" : 1, "program" : 1, "." : 2,

"we're" : 1, "happy" : 1, "have" : 1, "you" : 1 }, 13) == \

[ 1/13, 1/13, 1/13, 2/13, 1/13, 1/13, 2/13, 1/13, 1/13, 1/13, 1/13 ])

assert(build_unigram_probs(\

[ "this", "is", "the", "song", "that", "never", "ends", "yes", "it",

"goes", "on", "and", "my", "friends", "!", "some", "people", "started",

"singing", ",", "not", "knowing", "what", "was", "now", "they", "keep",

"forever", "just", "because", "." ],

{ "this" : 1, "is" : 1, "the" : 1, "song" : 1, "that" : 1, "never" : 1,

"ends" : 1, "yes" : 1, "it" : 4, "goes" : 1, "on" : 3, "and" : 2,

"my" : 1, "friends" : 1, "!" : 1, "some" : 1, "people" : 1,

"started" : 1, "singing" : 2, "," : 2, "not" : 1, "knowing" : 1,

"what" : 1, "was" : 1, "now" : 1, "they" : 1, "keep" : 1,

"forever" : 1, "just" : 1, "because" : 1, "." : 3 }, 41) == \

[ 1/41, 1/41, 1/41, 1/41, 1/41, 1/41, 1/41, 1/41, 4/41, 1/41, 3/41, 2/41,

1/41, 1/41, 1/41, 1/41, 1/41, 1/41, 2/41, 2/41, 1/41, 1/41, 1/41, 1/41,

1/41, 1/41, 1/41, 1/41, 1/41, 1/41, 3/41 ])

print("... done!")

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Programming Questions!

CANMNMM January of this year. (a) Each item will be held in a record. Describe all the data structures that must refer to these records to implement the required functionality. Describe all the...

Python and most Python libraries are free to download or use, though many users use Python through a paid service. Paid services help IT organizations manage the risks associated with the use of...

Portray in words what transforms you would have to make to your execution to some degree (a) to accomplish this and remark on the benefits and detriments of this thought.You are approached to compose...

A small ice block of mass m starts from rest from the top of an inverted bowl in the shape of a hemisphere, as shown in the figure. The hemisphere is fixed to the ground, and the block slides without...

What type of account must the sum of all subsidiary accounts be equal to?

Find the exact value of the given expression for the triangle in Fig. 20.49. sec 2 Fig. 20.49 10 4 3

=+e) To check for significant differences between the shelf means, we can use a Bonferroni test, whose results are shown here. For each pair of shelves, the difference is shown along with its...

Deluca Solutions Inc. is an Ontario- based manufacturer. The company is listed on the TSX, but the family of founder David Deluca retains control through multiple- voting shares. Deluca undertook...

A manufacturing company has a beginning finished goods inventory of $28,800, cost of goods manufactured of $59,000, and an ending finished goods inventory of $28,100. The cost of goods sold for this...

1. What is the difference between a change in accounting policy and a change in accounting estimate? 2. What is the meaning of retrospective application versus prospective application in IAS 8? 3. If...

1/19/2017 Principles of Operations Management: Sustainability and Supply Chain Management, Global Edition PRINTED BY: abdelkader.mazouz@aau.ac.ae. Printing is for personal, private use only. No part...

Devise three unique parametric expressions for a circle with a radius of \' r . \' These expressions should originate from the point ( 0 , - r ) and must meet the following conditions: ( a ) Complete...

QUESTION 5 [30 marks] Case Study: Innovation Offshoring: Asia's Emerging Role in Global Innovation Networks Source: www.eastwestcenter.org Most analysts agree that critical ingredients for economic...

1. Gwerrero 24 Aviation 31 Mid-Term Exam 10/18/22 onus (1 Point): Describe the primary means by which adverse yaw is reduced on the Cessna-150/152 and how works: Show all of your work and Pro Bonus...

2. List the Young's modulus, Yield strength(or breaking strength for brittle material), and failure strain for a)steel, b)aluminum alloy,c) glass, d) rubber, and e)Nylon. For strength and failure...

Your Corporation, a calendar-year company, acquired a new machine on January one, Year one. The cost of the machine is $375,000, and the machine has an estimated useful life of 8 years (or 900,000...

Find the radius of convergence of? 1.2.3 1.3.5 (2n-1) r2n+1 -1

63. Calculate the moment generating function of a geometric random variable.

64. Show that the sum of independent identically distributed exponential random variables has a gamma distribution.

66. Use Chebyshevs inequality to prove the weak law of large numbers. Namely, if X1, X2, . . . are independent and identically distributed with mean and variance 2 then, for any > 0, | X + X2 + +...