Outliers are data points that differ significantly from other observations in the dataset. They can skew and mislead the training of machine learning models, especially those sensitive to the scale of data, such as Linear Regression. They can sway the generality of the model, skewing predictions and increasing the standard deviation.
Related Concepts:
- Handling outliers in similar to Handling Missing Data
- Methods for Handling Outliers
- Anomaly Detection