Question: Data Preprocessing is an important area in order to have quality input data. The four methods discussed are data cleaning, data integration, data transformation and

Data Preprocessing is an important area in order to have quality input data. The four methods discussed are data cleaning, data integration, data transformation and data reduction.

a.Explain each of these methods in 2 or 3 paragraphs. (16 points)

b.Suppose we were to develop software to implement each of these techniques. Discuss how easy or difficult it would be to develop software for each technique. Give this some thought and write a well-thought answer. Imagine that you had to write such software, how easy or hard would it be? (7 points)

c.After the software was developed, we are now going to run the programs. Explain for each of the programs implementing the four techniques, how much CPU processing time is required. I am not looking for an actual number. Rather, I am looking for your analysis how much CPU processing is needed for each technique. (7 points)

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!