1Cademy - Parameterized Prediction Function using a BERT model

Learn Before

Uncluttered Notation for Encoder-Classifier Models

Formula

Parameterized Prediction Function using a BERT model

After the fine-tuning process, a complete BERT-based architecture for downstream tasks can be represented by the formula $\mathrm{Predict}_{\tilde{\omega}}(\mathrm{BERT}_{\tilde{\theta}}(\cdot))$ . This denotes that the model is applied to new data using the optimized, fine-tuned parameters $\tilde{\theta}$ for the BERT encoder and $\tilde{\omega}$ for the prediction network. The specific form of the downstream task dictates both the input and output formats of this model, as well as the underlying architecture of the prediction network layered on top of the BERT encoder.

0

1

Updated 2026-04-18

Contributors are:

Who are from:

References

Learn Before

Related

Learn After