Question: There will always be anomalies in data that can create gaps in our analytics. We call these outliers, as they tend to be well outside
There will always be anomalies in data that can create gaps in our analytics. We call these outliers, as they tend to be well outside of the normal distribution of the data. What steps can we take to smooth this data over when we see it Should we simply delete the outlier, or are there other tactics we can take in order to normalize for such a huge dispersion? For example, in your Netflix data, if we showed that someone was years old, we can safely conclude this is an error. Should be get rid of that entry entirely, or are there ways to preserve the data?
Step by Step Solution
There are 3 Steps involved in it
1 Expert Approved Answer
Step: 1 Unlock
Question Has Been Solved by an Expert!
Get step-by-step solutions from verified subject matter experts
Step: 2 Unlock
Step: 3 Unlock
