Question: Please help with this Explore the Weather Dataset with MapReduce jobs I will thumbs up!!! 1. Goto the book website http:/hadoopbook.com/ 2. Click on Code

Please help with this Explore the Weather Dataset with MapReduce jobs

I will thumbs up!!!

Please help with this Explore the Weather Dataset with MapReduce jobs I will thumbs up!!! 1. Goto the book website http:/hadoopbook.com/ 2. Click on Code and Data" link on t You could go ahead just download

1. Goto the book website http:/hadoopbook.com/ 2. Click on Code and Data" link on t You could go ahead just download l h codes for the book for your references. Not mandatory 3. Click on A sample of the NCDC weather dataset that is used throughout the book, can be found at 4. Click 1902.gz to download the dataset sample 5. gunzip the file 6. Copy the file to Hadoop cluster 7. Then copy the file to Hadoop HDFS file system 8. Run the awk code max temperature.sh (given on the Books' website or in textbook) Goto National Climatic Data Center (NCDC http://www.ncdc.noaa.gov) to explore datasets there, see if you can find the dataset that the text book was talking about. --if you need to copy the weather data to hadoop instance, this is how to do it, notice the -rp option copies the entire directory $ scp -i keyfile -rp weather ubuntu@54.68.190.87 weather or if you are running on EMR instances (user is hadoop) $ scp -i keyfile -rp weather ubuntu@54.68.190.87 weather 9. create 3 Java files. (Refer to Chp02 of the textbook) MaxTemperatureMapper.java, MapTemperatureRedcer.java, MaxTemperature.java Try to type in the code instead of copy and paste, or write your own codes After compile the code, export to a Jar file, or use mvn package, upload the jar file to a cloud master node $ scp-i keyfile ch02.jar ubuntu@5468.190.87:ch02.jar Switch to master node terminal Load the copy of 1902 weather data to HDFS cd weather $ hadoop fs -put 1902 /user/ubuntu/ch02_1902.txt Run your java MapReduce program $ hadoop jar ch02.jar MaxTemperature/user/ubuntu/ch02_1902.txt ch02_1902output Change the number of reducer from your java program. Go back to your MaxTemperature.java file, add this line job.setNumReduceTasks (3) 1. Goto the book website http:/hadoopbook.com/ 2. Click on Code and Data" link on t You could go ahead just download l h codes for the book for your references. Not mandatory 3. Click on A sample of the NCDC weather dataset that is used throughout the book, can be found at 4. Click 1902.gz to download the dataset sample 5. gunzip the file 6. Copy the file to Hadoop cluster 7. Then copy the file to Hadoop HDFS file system 8. Run the awk code max temperature.sh (given on the Books' website or in textbook) Goto National Climatic Data Center (NCDC http://www.ncdc.noaa.gov) to explore datasets there, see if you can find the dataset that the text book was talking about. --if you need to copy the weather data to hadoop instance, this is how to do it, notice the -rp option copies the entire directory $ scp -i keyfile -rp weather ubuntu@54.68.190.87 weather or if you are running on EMR instances (user is hadoop) $ scp -i keyfile -rp weather ubuntu@54.68.190.87 weather 9. create 3 Java files. (Refer to Chp02 of the textbook) MaxTemperatureMapper.java, MapTemperatureRedcer.java, MaxTemperature.java Try to type in the code instead of copy and paste, or write your own codes After compile the code, export to a Jar file, or use mvn package, upload the jar file to a cloud master node $ scp-i keyfile ch02.jar ubuntu@5468.190.87:ch02.jar Switch to master node terminal Load the copy of 1902 weather data to HDFS cd weather $ hadoop fs -put 1902 /user/ubuntu/ch02_1902.txt Run your java MapReduce program $ hadoop jar ch02.jar MaxTemperature/user/ubuntu/ch02_1902.txt ch02_1902output Change the number of reducer from your java program. Go back to your MaxTemperature.java file, add this line job.setNumReduceTasks (3)

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!

Question 1 Step 1 You work for Thunderduck Custom Tables Inc. This is the first month of operations. The company designs and manufactures specialty tables. Each table is specially customized for the...

it is an assignment for managerial accounting i want a full help from you to get all the data which is essential in making my assignment please help me and explore the answer for briefly with...

please help! Conceptual Overview: Explore the value of fixed-interest coupon bonds of different terms. This graph shows the value of 10% coupon bonds of different terms across differing market...

Please help !! All directions have been provided !! Thank you, god bless you !! All pages please !! :) Introduction If you have ever experienced financial hardship or grew up with financial hardship,...

please help me with this assignment, I will really appreciate your help View the 66-minute Consuming Kids movie (Please read the directions below before viewing the film). Introduction Consuming Kids...

please help I need this asap Traversing a Maze In order to traverse the maze, your program needs to detect either a dead end condition ( i.e. user facing wall ) or an unpursued path (i.e. name is...

Please help summarize the article. Then, respond to the main questions, "Why does this matter in the teaching and learning of mathematics?" and "How does this article inform math teaching practice?...

how to virtualize this network, and how would I calculate the recourses required? (Number of ESXi hosts, RAM, CPU, and storage) Quantity Device 3 Server Server Server Server Server Server 2 2 1 1 2...

The following table shows the items of assets, liabilities, cash inflows, and cash outflows for Ho in September. Rent $6500 Monthly take-home salary $21850 Spending for food $3450 Cash in checking...

An inwestor inwested $ 2 , 0 0 0 six years ago at 4 , 5 percent interest. She spends all of her interest earnings immediately, so she only recelves interest on her initial $ 2 , 0 0 0 investment. As...

Seved Help 14 Wisconsin Snowmobile Corp. is considering a switch to level production Cost efficiencies would occur under level production, and aftertax costs would decline by $31,500, but inventory...

What are Measures in OLAP Cubes?

How do OLAP Databases provide for Drilling Down into data?

How are OLAP Cubes different from Production Relational Databases?