Data Selection Handling Missing Data Handling Outliers One-hot encoding Handling Different Distributions Follow-up questions: Deleting rows or filling them with the mean is not always best