Question: 2. Your Assignment This programming assignment covers sort through Hadoop and Spark on multiple nodes. You must use a Chameleon node using Bare Metal Provisioning

2. Your Assignment This programming assignment covers sort through Hadoop and Spark on multiple nodes. You must use a Chameleon node using Bare Metal Provisioning (https://www.chameleoncloud.org). You must deploy Ubuntu Linux 22.04 using "compute-haswell" nodes, at the IIT sites. Once you create a lease (up to 7 days are allowed), and start your 1 physical node, and Linux boots, you will find yourself with a physical node with 24 CPU cores, 48 hardware threads, 128GB of memory, and 250GB SSD hard drive. You will install your favorite virtualization tools (e.g. virtualbox, LXD/KVM, qemu), and use it to deploy two different type of VMs with the following sizes: tiny.instance (4-cores, 8GB ram, 20GB disk), small.instance (4-cores, 8GB ram, 45GB disk), and large.instance (16-cores, 32GB ram, 180GB disk). This assignment will be broken down into several parts, as outlined below: Hadoop File System and Hadoop Install: Download, install, configure, and start the HDFS system (that is part of Hadoop, https://hadoop.apache.org) on a virtual cluster with 1 large.instance + 1 tiny.instance, and then again on a virtual cluster with 4 small.instances + 1 tiny.instance. You must set replication to 2 (instead of the default 3), or you won't have enough storage capacity to conduct your experiments on the 24GB dataset. Datasets: Once HDFS is operational, you must generate your dataset with gensort (http://www.ordinal.com/gensort.html); you will create 4 workloads: data-3GB, data-6GB, data-12GB, and data- 24GB. You may not have enough room to store them all, and run your compute workloads. Make sure to cleanup after each run. Remember that you will typically need 6X the storage, as you have the original input data (2x)

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Mathematics Questions!

1411116 - Programming I Assignment #3 Due Date: November 30, 2016 Submission Instructions: Submit your assignment on the blackboard link, corresponding to your Section: Please follow the following...

RMIT UNIVERSITY Programming Fundamentals (COSC2531) Assignment 2 Individual assignment (no group work). Submit online via Canvas/Assignments/Assignment 2. Marks are awarded per rubric (please see the...

I am having trouble with my java coding assignment and can't quite figure out where to go. I have most of it done but if someone could create this so I have something to reference I would greatly...

CST8333 Assignment 1 Project Initiation: Report & Presentation INSTRUCTIONS All material prepared for this assignment was produced by the author. Material from all third parties has been cited and...

Need help with MATLAB programming Assignment #1 Due: Monday, Jan. 27 2020 at 6:00pm MST Objective This assignment is designed to provide you with an introduction to MATLAB. You will create your own...

Hello so I need help coding my programming assignment 2, it builds off programming assignment 1 so i'm going to attach programming 1 details below first then follow up with the details for...

COSC 1436 Programming Assignment 5 Programming Assignment 5 In this assignment, you will refactor the program that you wrote for Assignment 3. The output of this program should be exactly the same as...

Students will review examples and evaluate them. Review these documents and evaluate them (click on the link): https://1drv.ms/w/s!AoYu6G3CLyuakjVCGipkRkNSBVUB?e=jrPXX6...

X Cewch Textbook foto che Gece Google Search Spring202 t/chaptet/1/section/20 (17) Cloud Main Conteszoon Webansign zy 17. designs CodeLab Turing Critical Theory Today 26 Calculate Structured...

N AaBb CcDdi AaBbCcDdEe ABCcDdEe Aal Heading 3 Normal No Spacing BUS 552 Fall 2020 Homework Assignment S: Linear Programming Due by 11:00 PM (PT) on September 28 Problem 1/20 Points) Today,...

What are some techniques for resolving conflict?

Linda decided to borrow $350,000 from a bank to buy her dream house. She approached Prosperity Bank and they offered Linda the following mortgage package: interest rate of 2.45 % per year, weekly...

What are direct bankruptcy costs?

David R. and Ella M. Cole (ages 39 and 38, respectively) are husband and wife who live at 1820 Elk Avenue, Denver, CO 80202. David is a self-employed consultant specializing in retail management, and...