Question: a) Explain the Map-reduce paradigm with respect to distributed and parallel processing. Use the following diagram below to illustrate all the four (4) phases
a) Explain the Map-reduce paradigm with respect to distributed and parallel processing. Use the following diagram below to illustrate all the four (4) phases [10 marks] List(K2, V2) K2,List(V2) K1,V1 Bear, (1,1) Bear, 2 Deer Bear River Deer, 1 Bear, 1 List(K3,V3) River, 1 Car, (1,1,1) Car, 3 Bear, 2 Dear Bear River Car, 1 Car, 3 Car Car River Car Car River Car, 1 Deer, 2 Deer Car Bear River, 1 Deer, (1,1) River, 2 Deer, 2 Deer Car Bear Deer, 1 Car, 1 Bear, 1 River, (1,1) River, 2 b) Using an application area of your choice explain how the Hadoop ecosystem can be used to handle big data. Be sure to specify the type of data, the source, the storage and processing techniques. [10 marks] c) Explain the concept of Federated databases and the coupling techniques. [5 marks]
Step by Step Solution
There are 3 Steps involved in it
Get step-by-step solutions from verified subject matter experts
