Question: 2. Consider that you are given a file containing the annotated form of the Ramayana which runs into 900MB as a text file. The

improve the performance? c) One of the mappers is progressing slowly. How does the Hadoop YARN framework

2. Consider that you are given a file containing the annotated form of the Ramayana which runs into 900MB as a text file. The Ramayana is broken into 7 major Kaandas (books) namely the Baala, Ayodhya, Aranya, Kishkindha, Sundara, Yuddha, Uttara kaandas. Unfortunately the sentences of this text file are not in order and are jumbled up. However, each sentence of the text is stored in the format , (assume there are no commas in the sentence). For example, a short section is of the form Yuddha, Hanuman set off to bring back the Sanjeevani plant You have been asked to process this file using Map Reduce using Hadoop v2. Answer the following questions providing justifications. Credit will be awarded only if the justification is right. a) Write MR pseudo code to identify the number of sentences in each kaanda? Identify the intermediate keys and final keys. b) How many mappers and reducers would be used for processing this? Will a combiner help to improve the performance? c) One of the mappers is progressing slowly. How does the Hadoop YARN framework respond to this?

Step by Step Solution

★★★★★

3.35 Rating (158 Votes )

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock

The students question seems to involve processing the text file of the Ramayana which is divided into seven books or Kaandas using the MapReduce parad... View full answer

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Programming Questions!

A ticket to the school dance is $6 and usually 250 students attend. The dance committee knows that for every $1 increase in the price of a ticket, 25 fewer students attend the dance. What ticket...

Reviewing the balance sheet of PEDRO Sporting Goods Corporation, Tom discovered that the total liabilities amounted to $6 million, while the owner's equity was $2 million. What is the total assets of...

Consider that you are given a file containing the annotated form of the Mahabharata which runs into 4GB as a text file. The Mahabharata is broken into 18 chapters of parvas and each parva had many...

3) a) Find ;) basis of Null space 1) basis of column space of A where 3 .5 4 A= -3 3 2 8 5 - 2 - 2 -2 iii) verfiy rank - nullity theorem.

consider you have been given a gile containing the annotated form of the mahabaratha which runs into 4 GB as a text file. The mahabaratha is broken down to 1 8 chapters of parvas and each parva has...

INTRODUCTION ABOUT SETU THE MAKING OF KAMALA PRE-PRODUCTION BUDGET STATEMENT FROM QUESTION 1: PLEASE HELP FILL IN THE GREEN SECTIONS IN THIS QUESTION 2. PLEASE EXPLAIN ANSWERS. It was a typical...

Please read chapter 6 and answer the questions and see the ( guide to answer number 3) For each case study, you will view the material as the student's teacher, read the information provided and...

Please read chapter 6 and answer the questions and see the ( guide to answer number 3) 1. Decide what assessment you would like to do to provide you with more information about the student, 2....

VIEW THE STEP-BY-STEP SOLUTION TO: Title: ABC Appliance Inherent Risk and Control Design Assessment Prepare Inherent Risks and Control Design Assessment. This course project/case... I.Title: ABC...

Identify and describe several key activities required for managing cultural diversity, and to what degree do HR professionals believe these activities are beneficial in maintaining a competitive...

Identify which control activity is violated in each of the following situations. 1. Once a month, the sales department sends unnumbered sales invoices to the accounting department to be recorded....

When delivering a business message, you should Blank _ _ _ _ _ _ . Multiple choice question. present it in the same way to all listeners adapt your message to the audience assume the audience has...

Marissa intends to make contributions to a Tax Free Savings Account (TFSA) such that the account will accumulate to $150,000 after 20 years. What end-of-quarter contributions must be made if the TFSA...

To provide a starting point for gauging a companys relative valuation, analysts often look at a companys price-to-earnings (P/E) ratio. Returning to the COMPANY OVERVIEW page, you can see XOMs...

On January 1, 2021, Labtech Circuits borrowed $100,000 from First Bank by issuing a three-year, 8% note, payable on December 31, 2023. Labtech wanted to hedge the risk that general interest rates...

A ships roll can be stabilized with a control system. A voltage applied to the fins actuators creates a roll torque that is applied to the ship. The ship, in response to the roll torque, yields a...

Corporate Social Responsibility. Methamphetamine (meth) is an addictive drug made chiefly in small toxic labs (STLs) in homes, tents, barns, and hotel rooms. The manufacturing process is dangerous,...

The concurring partner agreed to a censure from the SEC with regards to the KPMG audit of Xerox's 1997-2000 financial statements as a result of:(Check all that apply.)Multiple select question.Failure...

List and describe the contents of the system specification.

Find a questionnaire on the Web that has been created to capture customer information. Describe the purpose of the survey, the way questions are worded, and how the questions have been organized. How...

What is the most popular kind of database today? Provide three examples of products that are based on this technology.

The following table shows a breakdown of the 113th U.S. Congress by party affiliation. a. A member of Congress is selected at random. What is the probability of selecting a Republican? b. Given that...

What do you see as being the primary challenges to introducing a project management philosophy in most organizations? That is, why is it difficult to shift to a project-based approach in many...

1) Suppose you were a project manager for Disney. Based on the information in this case, what critical success metrics do you think the company uses when designing a new ride; that is, how would you...