Question: 3. (10 Marks) (a) Suppose that the data for analysis includes the attribute age. The age values for the data tuples are (in increasing order):
3. (10 Marks) (a) Suppose that the data for analysis includes the attribute age. The age values for the data tuples are (in increasing order): 13, 15, 16, 16, 19, 20, 20, 21, 22, 22, 25, 25, 25, 25, 30, 33, 33, 35, 35, 35, 35, 36, 40, 45, 46, 52, 70.
i. Use smoothing by bin means and by bin boundaries to smooth these data, with a bin depth of 5. Illustrate your steps. ii. Comment on the effect of this technique for the given data.
(b) (10 Marks) Imagine that you need to analyze iFashion sales and customer data. Note that many tuples have no recorded value for several attributes such as customer income. What are the methods for filling in the missing values for this attribute?
(c) (5 Marks) What problems can occur during Data Integration? Describe in brief.
Step by Step Solution
There are 3 Steps involved in it
Get step-by-step solutions from verified subject matter experts
