Question: This will be in a format that is structured with lines of data representing one type of unit, for example, one tweet for Twitter or
This will be in a format that is structured with lines of data representing one type of unit, for example, one tweet for Twitter or one post from Facebook. Your program will contain the data as lists of json structures, which are just python dictionaries and lists.
The program will do some processing to collect data from some of the fields that will answer one or more questions, as described below, and write a file with the data suitable for answering each question. Remember that some fields may be optional or have null values, so you may need to test for those conditions. Graphing is definitely optional.
Questions:
Types of questions:
process one collection of data and summarize information from a number of fields. This is similar to the example programs for Twitter hashtags or Facebook counts but must access different and more fields than in those examples.
process one collection of data and separate it into different categories and give some summary statistics on those categories. For example, bin the tweets by day or by hour and report on the number of tweets per day or hour.
process two or more collections of data and compare some summary data about the two collections. For example, collect Twitter user timelines from different political candidates and compare the number of retweets of their tweets.
You may use the programs twitter_lang.py as an example, but you must use different fields. You may also use twitter_hashtags.py or facebook_counts.py, but in these programs you must add a part to write a file. In all case, you must change the comments to reflect your individual understanding of the program. If you only do one question, then it must be more complex than these simple examples; otherwise, you may choose additional questions. Data: You may collect data from Twitter, Facebook or some other URL that returns JSON data. (If you want to use another format, such as XML, please ask.) If you collect your own, please collect at least several hundred data items, if possible. You could use the data that you are collecting for your project (even if it is not from Twitter or Facebook).
Step by Step Solution
There are 3 Steps involved in it
Get step-by-step solutions from verified subject matter experts
