Question: Then, submit a Jupyter notebook with Python code which does the following and answers the questions. Read the text into a text corpus. How many

Then, submit a Jupyter notebook with Python code which does the following and answers the questions.
Read the text into a text corpus.
How many words are in the corpus? --1 point
What are the five most common words? --3 points
Now, read the file and create a token list. Then, cleanse your token list by removing punctuation, converting everything to lower case, and removing stop words.
Repeat Step 4, indicating what are the five most common words after you have gone through the cleansing steps. Note: A token list is not the same thing as a corpus. You may find it helpful Python's "collections" package, which has a method called "most_common". --6 points

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Programming Questions!