Question: Scenario You have just started working as a data miner / analyst in the Analytics Unit of a company. The Head of the Analytics Unit

Scenario
You have just started working as a data miner/analyst in the Analytics Unit of a company. The Head of the
Analytics Unit has brought you a dataset [a welcome present;-))]. The dataset includes two files: a description of
the attributes and a table with the actual values of these attributes. The Head of the Analytics Unit has
mentioned to you that this is some sort of weather data that a potential client has provided for analysis. The
Head of the Analytics Unit would like to have a report with some insights about the data, that he/she could
deliver to the client. Your tasks include:
Understanding the specifics of the dataset;
Extracting information about each of the attributes, possible associations between them and other specifics
of the dataset.
The tasks in the assignment are specified below.
Tasks
A. Initial data exploration
A1. Identify the attribute type of each attribute in your dataset {Date, Location, MinTemp, MaxTemp, Rainfall,...}
(nominal, ordinal, interval or ratio). If it's not clear, you may need to justify why you chose the type.
A2. Identify the values of the summarising properties for the attributes, including frequency, location and spread
(e.g. value ranges of the attributes, frequency of values, distributions, medians, means, variances, percentiles, etc.
the statistics that have been covered in the lectures and materials given). Note that not all of these summary
statistics will make sense for all the attribute types, so use your judgement! Where necessary, use proper
visualisations for the corresponding statistics.
A3. Using KNIME or other tools, explore your dataset and identify any outliers, clusters of similar instances,
"interesting" attributes and specific values of those attributes. Note that you may need to 'temporarily' recode
attributes to numeric or from numeric to nominal. The report includes the corresponding snapshots from the
tools and an explanation of what has been identified there.
Present your findings in the assignment report.
Scenario You have just started working as a data

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Finance Questions!