Question: Big Data is a common buzz word thrown around in the world today. While not truly working with Big Data, this project will give you
Big Data is a common buzz word thrown around in the world today. While not truly working with Big
Data, this project will give you a chance to get an idea of how that data is used.
Big data involves the analysis of exceptionally large data sets that can potentially slow down
computational time; the amount of data to be analyzed in an efficient time frame to determine predictive
modelsbehaviors poses a problem to engineers from different fields. For example, the Human Genome
Project is a database of most human DNA sequences; this can be analyzed to understand the structure of
different viruses and mutations, which can help doctors formulate appropriate treatment plans for
patients. Predictive models have been developed for stock market trends to determine the optimal times
to buy and sell; the New England Complex Systems Institute has developed a model that can predict when
the stock market will crash based on how many stocks fluctuate together. Even water quality can be
predicted by big data analysis; this is really based on being able to anticipate a flood in a specific region
based on the amount of rainfall, storm durations, and drainage basin topography.
For this project, you will create a MATLAB program to analyze a large data set available on the internet.
Some examples of open access data sets are on Canvas. Your chosen data set should have at least
data points. Once you select your data set, you will need to pose several questions about your
data. You will then need create a program that allows you to answer those questions and others.
Project Requirements:
The code developed for the project should meet the following requirements:
Main Script File
o Should load the data set automatically, assuming that the data is in the same folder
o Should require the user to make selections about what the program will do either by
asking questions or presenting the user with a menu of options to choose from
o This script should continue to run until the user tells it to end
o The code must process ALL of the data in the set
UserDefined Functions at least
o Each function must have input and output arguments
o Each function must have at least one of the following structures:
IfElse Statements
Switch Case Statements
For Loop
While Loop
Nested Loop
o All functions have to be called at least once in the main script file
Data Analysis:
o The code must have options to do the at least of the following data manipulations
either in the main script or using a UDF:
Sort the data based on a chosen parameter
Produce descriptive statistics about the data mean standard deviation, etc.
Step by Step Solution
There are 3 Steps involved in it
1 Expert Approved Answer
Step: 1 Unlock
Question Has Been Solved by an Expert!
Get step-by-step solutions from verified subject matter experts
Step: 2 Unlock
Step: 3 Unlock
