Question: PYTHON I am working on next problem. Consider the following sentences written in Klingon. For each sentence, the part of speech of each word has

PYTHON

I am working on next problem.

Consider the following sentences written in Klingon. For each sentence, the part of speech of each word has been given (for ease of translation, some prefixes/suffixes have been treated as words), along with a translation. Using these training sentences, were going tobuild a Hidden Markov Model(HMM)to predict the part of speech of an unknown sentence using the Viterbi algorithm.

N PRO V N PRO

paDaq ghah taH terangan e

room (inside) he is human of

The human is in the room

V N V N

jachuqmeH rojHom neH terangan

in orderto parley truce want human

The enemy commander wants a truce in order to parley

N V N CONJ N V N

tera ngan qIp puq eg puq qIp terangan

human bit child and child bit child

The child bit the human, and the human bit the child

Step 1: Creating the Emission probability table(emission.javaor emission.py)Create a Emission probability table by computingthe frequencies of each part of speech in thetable below for all POS tags. Well use a smoothing factor of 0.1 (as discussed in class) to make sure that no event is impossible; add this number to all of your observations. Sample table valuesof two parts of speechhave been shown.Probability(word|tag) = Count(word,tag) / Count(tag)

and here is what I got for this part:

words1 = "paDaq ghah taH terangan e".replace("","'").split()

tags1 = "N PRO V N PRO".split()

words2 = "jachuqmeH rojHom neH terangan".replace("","'").split()

tags2 = "V N V N".split()

words3 = "terangan qIp puq eg puq qIp terangan".replace("","'").split()

tags3 = "N V N CONJ N V N".split()

train = []

train.append(zip(words1, tags1))

train.append(zip(words2, tags2))

train.append(zip(words3, tags3))

from collections import defaultdict

new_dict = defaultdict(list)

#print(new_dict)

for sent in train:

for word, tag in sent:

new_dict[word].append(tag)

#print(new_dict)

for word, tags in sorted(new_dict.items()):

row = []

row.append(word)

#print(row)

for tag in ["N", "V", "CONJ", "PRO"]:

row.append(tags.count(tag)+0.1)

There is next step: Creating the Transition probability table (transition.py) Generate a transition probability table by calculating the transition frequencies from one POS tag to another. Now, for each part of speech, total the number of times it transitioned to each other part of speech. Again, use a smoothing factor of 0.1. After youve done this, compute the start and transition probabilities. Sample table values of transition for two parts of speech have been shown.

Probability(tagi|tagi-1) = Count(tagi-1, tagi) / Count(tagi-1)

May someone help me here?

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!

PYTHON Consider the following sentences written in Klingon. For each sentence, the part of speech of each word has been given (for ease of translation, some prefixes/suffixes have been treated as...

Please scan the SEC Plain English that I've attached. Please visit to this link.http://www.sec.gov/Archives/edgar/data/320193/000119312513416534/d590790d10k.htm#toc590790_9 Please read pages 25...

Discuss Semantics and the challenges they are in English. 2 Language Structure and Use Learning Outcomes After reading this chapter, you should be able to ... Explain how language contributes to...

Rev.Confirming Pages C H A P T E R 7 Planning, Composing, and Revising Chapter Outline The Ways Good Writers Write Activities in the Composing Process Using Your Time Effectively Brainstorming,...

Read Classroom Glimpse. Discuss stress, rhythm, pitch, and intonation based on the tale in the classroom 2 Language Structure and Use Learning Outcomes After reading this chapter, you should be able...

You may practice teaching and learning tactics. Create a list you may use in class, others, and as a solo instructor. 2 Language Structure and Use Learning Outcomes After reading this chapter, you...

Providing Quality School-Based Learning and Support Services 239 Chapter 6 Language and literacy support Your core task The core task of almost all TAs is to support students language and literacy...

Create four language guidelines: two for Spanish and two for English, each with a descriptive and required component. This chapter is a brief introduction to modern linguistics and to topics that...

What are the biggest ah-ha! moments from Oracy Development? 6 English-Language Oracy Development Learning Outcomes After reading this chapter, you should be able to ... . Describe the basics of...

If 12.39 g of Urea (CN_(2)OH_(4)) are produced when 8.87 g of Ammonia react completely with Carbon dioxide gas, what is the percent yield for this reaction? 2NH_(3)(g) + CO_(2)(g) CN_(2)OH_(4)(s) +...

For each of the 32 National Football League teams, the numbers of points scored and allowed during the 2012 season are shown below. Assuming these are sample data, answer the following questions. You...

The diffusion coefficients for silver in copper are given at two temperatures (a) Determine the values of D0 and Qd. (b) What is the magnitude of D at 875C? D (m/s 5.5 x 1016 1.3 x 10 13 2 650 900

The Canliss Milling Company purchased machinery on January 2 , 2 0 2 2 , for $ 8 4 0 , 0 0 0 . A five - year life was estimated and no residual value was anticipated. Canliss decided to use the...

Enzymes are biological catalysts. Enzymes are proteins that are found inside cells to increase the rate of chemical reactions within each cell. Enzymes are denatured (destroyed) by various...

What is the purpose of a Position Control Table? What relationships to other Compensation Tables would be important?

What Data Elements are usually found in the Job Family Table, and what is the relationship of the Job Family Table to the Occupation Table?

What is the relationship between the Internal Staff Compensation Target Table and the Internal Staff Compensation Data Table?