1Cademy - ReLU (Rectified Linear Unit)

Learn Before

Non-Linear Activation Functions
Feed-Forward Network (FFN) Formula and Component Dimensions in Transformers

Concept

ReLU (Rectified Linear Unit)

The Rectified Linear Unit (ReLU) is a common choice for the activation function $\sigma(\cdot)$ within the hidden layers of neural networks. It is defined to output the positive portion of its argument. When applied to an input vector $\mathbf{h}$ , the ReLU function is given by the formula: $\sigma_{\mathrm{relu}}(\mathbf{h}) = \mathrm{max}(0, \mathbf{h})$ .