Question: Write a MapReduce program to find the maximum salary of an employee for each department along with the employee Id. Data is : EMPNO,EMPNAME,JOB,MANAGERID,HIREDATE,SAL,COMMISSION,DEPTNO7369,SMITH,CLERK,7902,13-06-13,800,0,207499,ALLEN,SALESMAN,7698,15-08-14,1600,300,307521,WARD,SALESMAN,7698,26-03-14,1250,500,307566,JONES,MANAGER,7839,31-10-14,2975,,207698,BLAKE,MANAGER,7839,11-06-14,2850,,307782,CLARK,MANAGER,7839,14-05-14,2450,,107788,SCOTT,ANALYST,7566,05-03-14,3000,,207839,KING,PRESIDENT,,09-06-14,5000,0,107844,TURNER,SALESMAN,7698,04-06-14,1500,0,307876,ADAMS,CLERK,7788,04-06-14,1100,,207900,JAMES,CLERK,7698,23-06-14,950,,307934,MILLER,CLERK,7782,21-01-14,1300,,107902,FORD,ANALYST,7566,05-12-14,3000,,207654,MARTIN,SALESMAN,7698,05-12-14,1250,1400,30 Dataset
Write a MapReduce program to find the maximum salary of an employee for each department along with the employee Id.
Data is :
EMPNO,EMPNAME,JOB,MANAGERID,HIREDATE,SAL,COMMISSION,DEPTNO7369,SMITH,CLERK,7902,13-06-13,800,0,207499,ALLEN,SALESMAN,7698,15-08-14,1600,300,307521,WARD,SALESMAN,7698,26-03-14,1250,500,307566,JONES,MANAGER,7839,31-10-14,2975,,207698,BLAKE,MANAGER,7839,11-06-14,2850,,307782,CLARK,MANAGER,7839,14-05-14,2450,,107788,SCOTT,ANALYST,7566,05-03-14,3000,,207839,KING,PRESIDENT,,09-06-14,5000,0,107844,TURNER,SALESMAN,7698,04-06-14,1500,0,307876,ADAMS,CLERK,7788,04-06-14,1100,,207900,JAMES,CLERK,7698,23-06-14,950,,307934,MILLER,CLERK,7782,21-01-14,1300,,107902,FORD,ANALYST,7566,05-12-14,3000,,207654,MARTIN,SALESMAN,7698,05-12-14,1250,1400,30
Dataset Description:
| COLUMN NAME | NULL? | TYPE |
| EMPNO | NOT NULL | NUMBER(4) |
| EMPNAME | NOT NULL | STRING |
| JOB | NOT NULL | STRING |
| MANAGERID | INT | |
| HIREDATE | DATE | |
| SAL | DOUBLE | |
| COMMISSION | DOUBLE | |
| DEPTNO | NOT NULL | INT |
Output: It should include the columns - Dept, max(salary), EmpNo
Constraints:
- Use a single MapReduce job to complete the task
- Also, skip the header row while reading the file
Expected Solution: You need to paste the MR code, Hadoop commands & path of the final jar that is used to achieve this output.
Step by Step Solution
There are 3 Steps involved in it
Get step-by-step solutions from verified subject matter experts
