Formula

Feed-Forward Network (FFN) Without Bias Terms Formula

A Feed-Forward Network (FFN) without bias terms modifies the standard FFN structure by omitting the bias parameters in its affine transformations. For an input vector h\mathbf{h}, the computation can be mathematically expressed as:

FFN(h)=σ(hWh)Wf\mathrm{FFN}(\mathbf{h}) = \sigma(\mathbf{h} \mathbf{W}_h) \mathbf{W}_f

where Wh\mathbf{W}_h and Wf\mathbf{W}_f represent the weight matrices, and σ\sigma denotes the non-linear activation function.

0

1

Updated 2026-04-21

Contributors are:

Who are from:

Tags

Foundations of Large Language Models

Ch.2 Generative Models - Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences