Question: 0 3 1 2 5 0 introduction to data analytic assignment 2 : Data exploration and preparation KINME guide As I have get a dataset,

031250 introduction to data analytic
assignment 2 : Data exploration and preparation
KINME guide
As I have get a dataset, but due to the lack of knowledge of KNIME, I do not really know what should I deal with my assigned dataset with KNIME to get the results that meets the requirement of this assignment.
A3. Using KNIME or other tools, explore your dataset and identify any outliers, clusters of similar instances, "interesting" attributes and specific values of those attributes. Note that you may need to 'temporarily' recode attributes to numeric or from numeric to nominal. The report includes the corresponding snapshots from the tools and an explanation of what has been identified there.
Present your findings in the assignment report.
B. Data preprocessing
Perform each of the following data preparation tasks (each task applies to the original data) using your choice of tool:
B1. Use the following binning techniques to smooth the values of the Rainfall attribute:
Equi-width binning
Equi-depth binning.
In the assignment report, for each of these techniques, you need to illustrate your steps. In your Excel workbook file place the results in separate columns in the corresponding spreadsheet. Use your judgement in choosing the appropriate number of bins - and justify this in the report.
B2. Use the following techniques to normalise the attribute MaxTemp:
min-max normalization to transform the values onto the range [0.0-1.0].
z-score normalization to transform the values.
The assignment report provides an explanation of each of the applied techniques. In your Excel workbook file place the results in separate columns in the corresponding spreadsheet.
B3. Discretise the WindSpeed3pm attribute into the following categories: Slow Wind, Medium Wind, Fast Wind, and Very Fast Wind. Provide the frequency of each category in your dataset.
The assignment report explains each of the applied techniques. In your Excel workbook file place the results in a separate column in the corresponding spreadsheet.
B4. Binarise the WindDir9am variable [with values "0" or "1"].
The assignment report explains the applied binarisation technique. In your Excel workbook file place the results in separate columns in the corresponding spreadsheet.

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Programming Questions!