Learn Before
Concept
Early Exit in DeeBERT
Early Exit in DeeBERT is achieved as follows:
- Training stage:
- Train and fine tune a BERT on downstream tasks.
- Freeze the parameters of BERT, insert a linear classifier after each Transformer layer.
- Train the classifier by minimizing the sum of their cross-entropy loss.
- Inference stage: early exits when an internal classifier output a distribution that has entropy lower than the predefined threshold.
0
1
Updated 2022-06-25
Tags
Data Science