Question: 0 3 1 2 5 0 introduction to data analytic assignment 2 : Data exploration and preparation KINME guide As I have get a dataset,
introduction to data analytic
assignment : Data exploration and preparation
KINME guide
As I have get a dataset, but due to the lack of knowledge of KNIME, I do not really know what should I deal with my assigned dataset with KNIME to get the results that meets the requirement of this assignment.
A Using KNIME or other tools, explore your dataset and identify any outliers, clusters of similar instances, "interesting" attributes and specific values of those attributes. Note that you may need to 'temporarily' recode attributes to numeric or from numeric to nominal. The report includes the corresponding snapshots from the tools and an explanation of what has been identified there.
Present your findings in the assignment report.
B Data preprocessing
Perform each of the following data preparation tasks each task applies to the original data using your choice of tool:
B Use the following binning techniques to smooth the values of the Rainfall attribute:
Equiwidth binning
Equidepth binning.
In the assignment report, for each of these techniques, you need to illustrate your steps. In your Excel workbook file place the results in separate columns in the corresponding spreadsheet. Use your judgement in choosing the appropriate number of bins and justify this in the report.
B Use the following techniques to normalise the attribute MaxTemp:
minmax normalization to transform the values onto the range
zscore normalization to transform the values.
The assignment report provides an explanation of each of the applied techniques. In your Excel workbook file place the results in separate columns in the corresponding spreadsheet.
B Discretise the WindSpeedpm attribute into the following categories: Slow Wind, Medium Wind, Fast Wind, and Very Fast Wind. Provide the frequency of each category in your dataset.
The assignment report explains each of the applied techniques. In your Excel workbook file place the results in a separate column in the corresponding spreadsheet.
B Binarise the WindDiram variable with values or
The assignment report explains the applied binarisation technique. In your Excel workbook file place the results in separate columns in the corresponding spreadsheet.
Step by Step Solution
There are 3 Steps involved in it
1 Expert Approved Answer
Step: 1 Unlock
Question Has Been Solved by an Expert!
Get step-by-step solutions from verified subject matter experts
Step: 2 Unlock
Step: 3 Unlock
