Question: Please provide Python code to answer the following questions, please don't use handwriting code, thank you # use Penn Treebank P.O.S for POS Tagging import

Please provide Python code to answer the following questions, please don't use handwriting code, thank you

# use Penn Treebank P.O.S for POS Tagging import nltk from nltk import word_tokenize from nltk.corpus import brown

# Question 20: use given words like BTWords (Brown corpus tagged words) or sample text # 20.a: Print the first 5 words from an alphabetically sorted list of the distinct words tagged as MD. (MD == Modal) BTWords = nltk.corpus.brown.tagged_words() ModalWords = [w for (w, t) in BTWords if t == 'MD'] sorted(set(ModalWords))[:5]

# 20.c: Identify three-word prepositional phrases of the form IN + DT + NN (e.g., in the lab) using raw_sent sentence. # Note: Textbook says DET, but current Brown corpus uses DT instead. # need to tokenize first, POS Tag and trigram. # see an example: to in trigrams for tagged_sent in brown.tagged_sents(): for (w1,t1), (w2,t2), (w3,t3) in nltk.trigrams(tagged_sent): if (t1.startswith('V') and t2 == 'TO' and t3.startswith('V')): print(w1, w2, w3)

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!