Question: In PYTHON PLEASE Write a program that takes as input a text file of arbitrary length and count the occurrence of words in that text.

In PYTHON PLEASE

Write a program that takes as input a text file of arbitrary length and count the occurrence of words in that text. Present the counts and words, sorted with the highest count first, and remove punctuation and change all words to lowercase. Also remove stopwords. Follow these steps:

implement the starter program. You need to comment the program. Read the code. Figure out how it works.

a file of stopwords for you to use called stopWordsEng.txt. read the file and omit these words from the word counts made in the program.

Starter program:

def byFreq(pair): return pair[1] def main(): print("This program analyzes word frequency in a file") print("and prints a report on the n most frequent words. ") # get the sequence of words from the file fname = input("File to analyze: ") text = open(fname,'r').read() text = text.lower() for ch in '!"#$%&()*+,-./:;<=>?@[\\]^_`{|}~': text = text.replace(ch, ' ') words = text.split() #  counts = {} for w in words: counts[w] = counts.get(w,0) + 1 #  n = int(input("Output analysis of how many words? ")) # Print your output here. The command below is a placeholder # to get you started, but it is not the final solution! print(counts) main()

stopwords file:

it is a file with words like i me my myself we our ours ourselves you your yours yourself yourselves he him his himself she her hers herself it its itself they them their theirs themselves what which who whom, etc

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!