Consider that you are given a file containing the annotated form of the Mahabharata which runs...
Fantastic news! We've Found the answer you've been seeking!
Question:
Transcribed Image Text:
Consider that you are given a file containing the annotated form of the Mahabharata which runs into 4GB as a text file. The Mahabharata is broken into 18 chapters of parvas and each parva had many shlokas. Different shlokas were given to different scholars for translation to English and each shloka and its translation were entered into a web page that accepted data in the following format and stored it on a text file Parva Number, Shloka Number, Translation And hence were in random order in the file "Mahabharata.txt" which was stored on HDFS. Design a MapReduce program to sort all the shlokas and their translations in the right order both based on the Parva and the shloka within it. You need not write entire map-reduce code, but need to identify the intermediate keys of the mapper and corresponding values. The keys at the reducer and the corresponding output values. Do you need anything else to make this work? How many mappers do you expect to start if you are using Hadoop v2? Consider that you are given a file containing the annotated form of the Mahabharata which runs into 4GB as a text file. The Mahabharata is broken into 18 chapters of parvas and each parva had many shlokas. Different shlokas were given to different scholars for translation to English and each shloka and its translation were entered into a web page that accepted data in the following format and stored it on a text file Parva Number, Shloka Number, Translation And hence were in random order in the file "Mahabharata.txt" which was stored on HDFS. Design a MapReduce program to sort all the shlokas and their translations in the right order both based on the Parva and the shloka within it. You need not write entire map-reduce code, but need to identify the intermediate keys of the mapper and corresponding values. The keys at the reducer and the corresponding output values. Do you need anything else to make this work? How many mappers do you expect to start if you are using Hadoop v2?
Expert Answer:
Answer rating: 100% (QA)
To sort all the shlokas and their translations in the right order based on both the Parva number and the shloka number within it you can design a MapR... View the full answer
Related Book For
Income Tax Fundamentals 2013
ISBN: 9781285586618
31st Edition
Authors: Gerald E. Whittenburg, Martha Altus Buller, Steven L Gill
Posted Date:
Students also viewed these programming questions
-
Reviewing the balance sheet of PEDRO Sporting Goods Corporation, Tom discovered that the total liabilities amounted to $6 million, while the owner's equity was $2 million. What is the total assets of...
-
A ticket to the school dance is $6 and usually 250 students attend. The dance committee knows that for every $1 increase in the price of a ticket, 25 fewer students attend the dance. What ticket...
-
2. Consider that you are given a file containing the annotated form of the Ramayana which runs into 900MB as a text file. The Ramayana is broken into 7 major Kaandas (books) namely the Baala,...
-
Maria is opposed to the idea of same-sex marriage. In a recent conversation in the school cafeteria, Maria argues, "If homosexuals are allowed to marry, then why not allow polygamy or other kinds of...
-
Locate the centroid yc of the shaded area. Given: a = 4 m b = 4 m (
-
Why might life insurance and retirement plans be appropriate choices for charitable gifts in the overall estate plan of a donor?
-
Let \(X\) denote the time between detections of a particle with a Geiger counter and assume that \(X\) has an exponential distribution with \(E(X)=1.4\) minutes. The probability that we detect a...
-
One cause of the downtime in Problem 3 was traced to a specific piece of computer hardware. Management believes that switching to a different hardware component will result in the following...
-
What happened to the USA's economic growth? What is the main contributing factor to this change? Discuss in a couple of sentences things that are important for the change in the USA's economic...
-
Based on the two tables and the attributes below, write SQL commands for each question to retrieve the data from the database. tblSales InvoiceNumber InvoiceDate CustomerNumber Sales OrderNumber...
-
1. Post adjusting entries to the T-accounts. 2. Prepare an income statement, a balance sheet, and a statement of cash flows using the final ending T-account balances as of January 31. Assets: Cash...
-
Huang Company presented the following data (yen in thousands). Instructions Compute earnings per share. Net income Preference shares: 50,000 shares outstanding, 100 par, 8% cumulative, not...
-
Consider a sample taken from the population of all taxi-in times for all flights that land in Los Angeles. Identify the symbols used for the sample mean and the population mean.
-
A portion of the statement of income and retained earnings of Pierson Inc. for the current year follows. During the year, Pierson Inc. had a loss from discontinued operations of \($1\),340,000 after...
-
At January 1, 2015, Cameron Companys outstanding shares included the following. Net income for 2015 was R\($2\),830,000. No cash dividends were declared or paid during 2015. On February 15, 2016,...
-
Amy Dyken, controller at Fitzgerald Pharmaceutical Industries, a public company, is currently preparing the calculation for basic and diluted earnings per share and the related disclosure for...
-
Draw a full Lewis structure for CH3CHNH2, showing all bonds, lone pairs, and atoms (4 points). Use it to answer the mulitple choice questions (2 points each) 0-6 Can it hydrogen bond to itself? Can...
-
In the operation of an automated production line with storage buffers, what does it mean if a buffer is nearly always empty or nearly always full?
-
Bev and Ken Hair have been married for 3 years. They live at 3567 River Street, Springfield, MO 63126. Ken is a full-time student at Southwest Missouri State University (SMSU) and Bev works as an...
-
Karim Depak received a Form 1099-B showing the following stock transactions and basis during 2012: None of the stock is qualified small business stock. Calculate Karim's net capital gain or loss...
-
Carol Harris, Ph.D, CPA, is a single taxpayer and she lives at 674 Yankee Street, Durham, NC 27409. Her Social Security number is 793-52-4335. Carol is an Associate Professor of Accounting at a local...
-
Some people argue that the government should not intervene in the case of a market failure because the government itself is inefficient and will simply create new problems to replace the ones it is...
-
Consider each of the following issues and discuss whether you support Theory X, Theory Y, neither theory, or some combination of them. Issue Theory X Theory Y Whether a person is healthy or sick...
-
Looking at Medicaids traditional eligibility rules, you will notice numerous value/policy judgmentspregnant women and children are favored over childless adults, the medically needy are favored over...
Study smarter with the SolutionInn App