Question: 2- (3 points) [MapReduce Framework] Assume that you have 10 billion text lines representing the amount of different medicines taken by patients of a hospital.
![2- (3 points) [MapReduce Framework] Assume that you have 10 billion](https://dsd5zvtm8ll6.cloudfront.net/si.experts.images/questions/2024/09/66f52801dc9b5_59366f5280168d5c.jpg)
2- (3 points) [MapReduce Framework] Assume that you have 10 billion text lines representing the amount of different medicines taken by patients of a hospital. Below you see a snapshot of data. Acetaminophen HUMULIN magnesium oxide HUMULIN Acetaminophen Acetaminophen Acetaminophen 4 grams 3 grams 4 grams 1 grams 1 grams 5 grams 2 grams You have been assigned to calculate the total dosage of each medicine. a. How would you do it using one single computer, considering the fact that you cannot store all the text lines in the memory. Describe your approach. (Tip: you need to read the data line by line incrementally.) b. Assume that you have access to a cluster of computers, and you should design a MapReduce job to calculate the dosages. How would you do it? (Note: you should describe what the key and value are and also all the details regarding the map and reduce procedure you need to have to calculate the total dosage)
Step by Step Solution
There are 3 Steps involved in it
Get step-by-step solutions from verified subject matter experts
