What are they?

  • Outliers are data points that differ significantly from other observations in the dataset.
  • Outliers are not just points that are far from the mean or visually strange.

Their effects:

  • They can skew and mislead the training of machine learning models, especially those sensitive to the scale of data, such as Linear Regression.
  • They can sway the generality of the model, skewing predictions and increasing the standard deviation.

Related Concepts: