Question: I have written a CKY parser, but the unit tests are failing with the wrong probabilities. I have tried the best _ prob variable as

I have written a CKY parser, but the unit tests are failing with the wrong probabilities. I have tried the "best

_

prob" variable as both a float and a decimal, but I am still getting the incorrect probabilities.

These are the instructions:

Create a function named cky

_

parsing that applies the CKY algorithm to parse sentences using a given Probabilistic Context

-

Free Grammar

(

PCFG

) .

The function should handle unknown words by substituting them with and should employ the Viterbi parsing algorithm. Additionally, it should account for the grammar's productions to identify known words.

1 .

Function Definition:

Name the function cky

_

parsing.It should accept two parameters: sentences

(

a list of sentences to be parsed

)

and grammar

(

the PCFG used for parsing

) .

2 .

Preprocessing:

Within the function, construct a set of known words present in the given productions

(

generate 'production' set from the grammar passed

) .

This set is used to determine if words in the sentences are covered by the grammar.

3 .

Viterbi Parser Setup:

Initialize a Viterbi parser using the provided PCFG

.

4 .

Sentence Processing:

Iterate over each sentence in sentences:

Tokenize the sentence.Replace any word not found in the set of known words with

.

Parse the sentence using the Viterbi parser.Select the parse with the highest probability, or handle cases where no valid parse is found

(

check the parse

_

all method of the ViterbiParser object from the nltk

.

parse library

)

5 .

Return Value

The function should return a list of tuples. One tuple per sentence processed. Each tuple should contain:

The index of the sentence within the input list.

The original sentence.

The best parse tree found, or an appropriate value indicating the grammatical structure of sentences.

` ` `

from nltk

.

grammar import PCFG

from nltk

.

parse import ViterbiParser

from decimal import Decimal

#Create a function named cky

_

parsing

def cky

_

parsing

(

sentences

,

grammar

)

#construct a set of known words present in the given productions

known

_

words

=

set

()

for production in grammar.productions

()

for rhs in production.rhs

()

if isinstance

(

rhs

,

str

)

: # check for word

known

_

words.add

(

rhs

)

# Initialize a Viterbi parser using the provided PCFG

viterbi

_

parser

=

ViterbiParser

(

grammar

)

iter

_

results

= []

# Iterate over each sentence in sentences

for index, sentence in enumerate

(

sentences

)

# Tokenize the sentence

tokens

=

nltk

.

word

_

tokenize

(

sentence

)

# Replace any word not found in the set of known words with

process

_

tokens

= [

token if token in known

_

words else

''

for token in tokens

]

# Parse the sentence using the Viterbi parser

parse

_

trees

=

viterbi

_

parser.parse

_

all

(

process

_

tokens

)

# Select the parse with the highest probability, or handle cases where no valid parse is found

best

_

parse

=

None

best

_

prob

=

Decimal

()

for tree in parse

_

trees:

prob

=

tree.prob

()

if prob

>

best

_

prob:

best

_

prob

=

prob

best

_

parse

=

tree

if best

_

parse is not None:

iter

_

results.append

((

index

,

sentence, best

_

parse

))

#append best parse

else:

iter

_

results.append

((

index

,

sentence, None

))

#append none

return iter

_

results

` ` `

The unit tests are failing with:

Actual output : NNP

- >

'Beach'

[0.016129]

Expected output: NNP

- >

'Beach'

[0.00735294]

Actual output : NP

- >

WDT NNS

[0.0196078]

Expected output: NP

- >

WDT NNS

[0.00144928]

Actual output : VBG

- >

'leaving'

[0.166667]

Expected output: VBG

- >

'leaving'

[0.264706]

Actual output : FRAG

* - >

PP PP

[0.142857]

Expected output: FRAG

* - >

PP PP

[0.416667]

Actual output : RBS

- >

'least'

[1.0]

Expected output: RBS

- >

'least'

[1.0]

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Programming Questions!

answer the following questions: What are your moral obligations as professionals to prevent such things from happening? What guidance do the professional codes provide (cite specific code sections in...

think about what procedural changes would have the biggest positive impact, without being excessively costly for our lab members at every level (including undergrads!). Reference: the Lab Data Check...

Please read the academic paper and answer the question. 3. Do you believe JC Premium Cars will run into future recruitment and selection challenges and explain why? For the exclusive use of J. Judy,...

Please read the academic paper and answer the question. Thank you. Based on class learnings, what are 3 tactics that JC Premium Cars did well and explain why. For the exclusive use of J. Judy, 2023....

In this assignment, you will construct a constituency tree and implement the task of POS tagging using constituency parsing and dependency parsing methods. Constituency parsing is the process of...

Self-Employment 7-51 COMPREHENSIVE PROBLEM 32. Maria A. Solo (SSN 318-01-6921) lives at 190 Glenn Drive, Grand Rapids, Michigan 19527-2005. Maria (ape 45 and implemberaut Selda Ray (SSN 282-61-4011),...

Problem 1: Parse Trees. Assume that in the sentence "Father made her fish." the word "her" can be either a personal (PRP) or possessive pronoun (PRPS), and the word "fish" can be either a noun (NN)...

3. (3 points) Consider the following PCFG (probabilites for each rule are whown after the rule): - 1.0 1.0 0.7 0.3 1.0 - 1.0 S- NP VP NP DT NBAR NBAR NN NBAR NBAR NBAR NP_C NP CNP VP sleeps DT the NN...

A company issued a $519,000 bond and received $493,900 cash on May 14. Write the journal entry to record the issuing transaction. Do not enter dollar signs or commas in the input boxes. CKY Security...

Over the course of 34 years, you plan to accumulate 1 million and retire immediately after. You will save $572.47 every month into your investment account starting one month from now. Part 1 Attempt...

You are a loan officer for First Benevolent Bank. You have an uneasy feeling as you examine a loan application from Daring Corporation. The application included the following financial statements. It...

A U . S . shareholder's share of a foreign corporation's mandatory inclusion amount _ blank _ .

For the month of January, a manufacturing company incurred the following costs: Direct materials purchased: $80,000 Direct labor: $50,000 Manufacturing overhead: $40,000 Beginning finished goods...