Question: in PYTHON : use JustDoIt dataset. write a python program to count number of occurancs for each #tag and time it re-wrire the program into
in PYTHON :
use JustDoIt dataset.
write a python program to count number of occurancs for each #tag and time it
re-wrire the program into mapper.py and reducer.py and run it on hadoop. Analyse how hadoop has processed the program and compare the perfromance with your paython code.
re-write the code using mrjob and sort the output

Step by Step Solution
There are 3 Steps involved in it
Get step-by-step solutions from verified subject matter experts
