Logistic Regression

Logistic regression models the log-odds of the probability as a linear function of the input features.

It models the probability of an input belonging to a particular class using a logistic (sigmoid) function.

The model establishes a decision boundary (threshold) in the feature space.

Logistic regression is best suited for cases where the decision boundary is approximately linear in the feature space.

Logistic Regression can be used for Binary Classificationtasks.

In ML_Tools, see:

Regression_Logistic_Metrics.ipynb

Key Concepts of Logistic Regression

Logistic Function (Sigmoid Function)

Logistic regression models the probability that an input belongs to a particular class using the logistic (sigmoid) function. This function maps any real-valued input into the range (0,1), representing the probability of belonging to the positive class (usually class 1).

The sigmoid function is defined as:
$σ (z) = \frac{1}{1 + e ^{- z}}$
where
$z = w \cdot x + b$
Thus, the logistic regression model is given by:
$P (y = 1 ∣ x) = σ (z)$

Log odds: Transforming from continuous to 0-1

Logistic regression is based on the log-odds (logit) transformation, which expresses probability in terms of odds:

$Odds = \frac{P ( y = 1 ∣ x )}{1 - P ( y = 1 ∣ x )}$

Taking the natural logarithm of both sides gives the logit function:

$lo g (\frac{P ( y = 1 ∣ x )}{1 - P ( y = 1 ∣ x )}) = w \cdot x + b$

This equation shows that logistic regression models the log-odds of the probability as a linear function of the input features.

Decision Boundary

Similar to Support Vector Machines, logistic regression defines a decision boundary that separates the two classes.
The logistic function determines the probability of a data point belonging to a specific class. If this probability exceeds a given threshold (typically 0.5), the model assigns the point to the positive class; otherwise, it is classified as negative.

Binary Classification

Logistic regression is primarily used for binary classification tasks, where the target variable has only two possible values (e.g., “0” and “1”).
It can handle multiple independent variables (features) and assigns probabilities to the target classes based on the feature values.
Examples include:

No Residuals

Unlike Linear Regression, logistic regression does not compute standard residuals.
Instead, Model Evaluation is performed by comparing predicted probabilities with actual class labels using metrics such as accuracy, precision, recall, and the Confusion Matrix.

Also see:

Related terms:

Cost function for logistic regression
Gradient computation in logistic regression
Regularized logistic regression
Cost function for regularized logistic regression

Logistic regression can be extended to handle non-linear decision boundaries through:

Polynomial features to capture more complex relationships.
Regularization techniques to improve generalization.

Explaining logistic regression

Notes:

coefficients help with Feature Importance

Data Archive

Explorer

Logistic Regression

Key Concepts of Logistic Regression

Logistic Function (Sigmoid Function)

Log odds: Transforming from continuous to 0-1

Decision Boundary

Binary Classification

No Residuals

Also see:

Backlinks

Explorer

Data Archive

Explorer

Logistic Regression

Related Notes:

Key Concepts of Logistic Regression

Logistic Function (Sigmoid Function)

Log odds: Transforming from continuous to 0-1

Decision Boundary

Binary Classification

No Residuals

Also see:

Backlinks

Explorer