Forward Propagation in Neural Networks

Summary

Forward propagation is the process by which input data moves through a neural network, layer by layer, to produce an output. During this process, each layer’s weights and biases are applied to the input data, and an activation function is used to transform the data at each layer.

Mathematically, for each layer, the input $x$ is transformed into an output $y$ through the equation $y = f (W x + b)$ , where $W$ represents the weights, $b$ is the bias, and $f$ is the activation function (e.g., ReLU, sigmoid). The output from one layer becomes the input to the next, and this continues until the final layer produces the predicted output.

This process does not involve learning; it only computes the prediction based on current weights.

Breakdown

Key Components:

Input data: Initial values fed into the network.

Weights ( $W$ ) and biases ( $b$ ): Parameters adjusted during training.

Activation function: Non-linear transformation, e.g., ReLU or sigmoid.

Output: Prediction made by the network.

Important

Forward propagation calculates predictions by applying current model parameters to inputs.

It is the first step before backpropagation, where the error is used to adjust weights.

Attention

Forward propagation does not involve learning or updating weights.

The accuracy of forward propagation depends entirely on the current values of weights and biases.

Example

In a simple neural network with one hidden layer, forward propagation can be described as:
$z_{1} = W_{1} x + b_{1}$
$a_{1} = ReLU (z_{1})$
$z_{2} = W_{2} a_{1} + b_{2}$
$y = sigmoid (z_{2})$
Here, $x$ is the input, and $y$ is the output prediction.

Follow

How does the choice of activation function impact the forward propagation process?

In deep networks, how can vanishing and exploding gradients problem during forward propagation affect training?

Data Archive

Explorer

Forward Propagation in Neural Networks

Backlinks

Explorer