Question: NOTE: These are big data concepts course Chapter: hadoop MapReduce NOTE: Please solve the questions clearly and step by step and leave the question to
1- Draw the workflow of MapReduce that count the number of students registered in each course. [2 marks] Program Input: Courses registered by each student Student ID =1, courses =3, database, programming, data structure Student ID =2, courses =2, python, operating system Student ID =3, courses =3, python, database, programming Student ID =4, courses =4, data structure, programming, math, machine learning Program output: Total number of students in each course 2. Consider A file containing the following text: [1 mark] When working with serialized data, you must specify the characters. that separate values within and between the key-value pairs Convert this file into one split of four records as key-value pairs. 3- Consider the size of job file is 4GB Compute the following: [2 marks] a. Number of blocks if the size of each block is 128MB. b. The size of each file if the job file broken into 20000 files. c. The number of map tasks if each map contains 1000 file. d. The number of tasks if each task contains 5 maps. e. The total time of job if the task executes in 3 seconds
Step by Step Solution
There are 3 Steps involved in it
Get step-by-step solutions from verified subject matter experts
