Question: In this assignment we are going to work with a larger collection of tweets (10,000) that are available here: http://rasinsrv07.cstcis.cti.depaul.edu/CSC455/Assignment5.txt A.Using python, identify the top-5
In this assignment we are going to work with a larger collection of tweets (10,000) that are available here: http://rasinsrv07.cstcis.cti.depaul.edu/CSC455/Assignment5.txt
A.Using python, identify the top-5 most frequent terms (words separated by ) that are at least 4 characters or longer (i.e. ignore articles such as a or the and any other short terms) in the text of the tweets. It is up to you whether you prefer to use the contents of the loaded database (reading tweets from SQLite, which contains fewer tweets) or the contents of the original Assignment5.txt file (reading tweets directly from the file).
Step by Step Solution
There are 3 Steps involved in it
Get step-by-step solutions from verified subject matter experts
