Question: Consider that you are given a file containing the annotated form of the Mahabharata which runs into 4GB as a text file. The Mahabharata

Consider that you are given a file containing the annotated form of the Mahabharata which runs into 4GB as a

Consider that you are given a file containing the annotated form of the Mahabharata which runs into 4GB as a text file. The Mahabharata is broken into 18 chapters of parvas and each parva had many shlokas. Different shlokas were given to different scholars for translation to English and each shloka and its translation were entered into a web page that accepted data in the following format and stored it on a text file Parva Number, Shloka Number, Translation And hence were in random order in the file "Mahabharata.txt" which was stored on HDFS. Design a MapReduce program to sort all the shlokas and their translations in the right order both based on the Parva and the shloka within it. You need not write entire map-reduce code, but need to identify the intermediate keys of the mapper and corresponding values. The keys at the reducer and the corresponding output values. Do you need anything else to make this work? How many mappers do you expect to start if you are using Hadoop v2?

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock

To sort all the shlokas and their translations in the right order based on both the Parva number and the shloka number within it you can design a MapR... View full answer

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Programming Questions!

Reviewing the balance sheet of PEDRO Sporting Goods Corporation, Tom discovered that the total liabilities amounted to $6 million, while the owner's equity was $2 million. What is the total assets of...

A ticket to the school dance is $6 and usually 250 students attend. The dance committee knows that for every $1 increase in the price of a ticket, 25 fewer students attend the dance. What ticket...

2. Consider that you are given a file containing the annotated form of the Ramayana which runs into 900MB as a text file. The Ramayana is broken into 7 major Kaandas (books) namely the Baala,...

Suppose that you are given an n n checkerboard and a checker. You must move the checker from the bottom edge of the board to the top edge of the board according to the following rule. At each step...

In the capital budgeting model in Figure 14.40, we supplied the NPV for each investment. Suppose instead that you are given only the streams of cash inflows from each investment shown in the file...

Solve the previous problem using the input data in the file P14_50.xlsx. In the capital budgeting model in Figure 14.40, we supplied the NPV for each investment. Suppose instead that you are given...

Programming Project 2 asks you, among other things, to write a program that creates a binary file of objects of the class Species. Write a program that reads from a file created by that program and...

Suppose that you are given n red and n blue water jugs, all of different shapes and sizes. All red jugs hold different amounts of water, as do the blue ones. Moreover, for every red jug, there is a...

Business ethics is perhaps one of the most personal and emotional areas in business decision making. Business ethics can be confusing and complex because a right or wrong answer often does not exist....

Consider that you are 35 years old and have just changed to a new job. You have $80,000 in the retirement plan from your former employer. You can roll that money into the retirement plan of the new...

Read the profile of the Cloverdale Mall (Spotlight on Retailing). What are the strengths and weaknesses of their location strategy?

The Medium Run AS-AD Model A. The US economy was sent into a recession by the external shock of the Covid pandemic. Using the AS-AD model, show and explain the impact and how the economy is supposed...

In performing an audit in accordance with Generally Accepted Government Auditing Standards ( the "Yellow Book" ) , the auditor: A . Accepts less responsibility in conducting fieldwork than is...

Examine the dashboard of IT expenditures in the federal government discussed in this chapter. What recommendations for changes in the budget can you find by examining these data?

The connecting rod AB of a certain internal-combustion engine weighs 1.2 lb with mass center at G and has a radius of gyration about G of 1.12 in. The piston and piston pin A together weigh 1.80 lb....

You are a big financial success and you want to purchase the Remlab Company for $35 billion. You have $5 billion in cash, but need to borrow the remaining $30 billion from your friendly banker. You...

The X-ray department of a hospital served 934 patients last month and uses 185 square metres of space. The ultrasound department of the hospital served 1800 patients last month and uses 275 square...

What is the formula for calculating Return on Investment (ROI) in project management?Question 29 options:Cost Time(Net Benefits / Costs)100Total Cost Project DurationBenefits Risks

Bev and Ken Hair have been married for 3 years. They live at 3567 River Street, Springfield, MO 63126. Ken is a full-time student at Southwest Missouri State University (SMSU) and Bev works as an...

Karim Depak received a Form 1099-B showing the following stock transactions and basis during 2012: None of the stock is qualified small business stock. Calculate Karim's net capital gain or loss...

Carol Harris, Ph.D, CPA, is a single taxpayer and she lives at 674 Yankee Street, Durham, NC 27409. Her Social Security number is 793-52-4335. Carol is an Associate Professor of Accounting at a local...

The Wilson Company uses a great deal of water in the process of making industrial milling equipment. To comply with the federal clean water laws, it has a water purification system that all...

The Inland Empire Food Store Company has stated in its advertising that the average shopper will save more than $5.00 per week by shopping at Inland stores. A consumer group has decided to test this...

The Haines Lumber Company makes plywood for the furniture industry. One product it makes is 3/4-inch oak veneer panels. It is very important that the panels conform to specifications. One...