Variability in linear models
Mathematically, SSE is a Loss function given by:
where is the number of observations, is the actual value of the dependent variable for observation , and is the predicted value based on the linear regression model.
The formula for the Regression Sum of Squares (SSR) in the context of linear regression is:
Where:
- is the predicted value of the dependent variable for observation based on the linear regression model.
- is the mean of the observed values of the dependent variable.
- is the total number of observations.
SSR measures the amount of variability in the dependent variable that is explained by the independent variables in the model. It reflects how well the regression model captures the relationship between the independent and dependent variables.
Total Sum of Squares (SST) represents the total variability in the dependent variable . The relationship between SST, SSR, and SSE is given by:
This equation reflects the decomposition of total variability into explained variability (SSR) and unexplained variability (SSE) due to errors.