1Cademy - Model Usage After Replaced Token Detection Training

Learn Before

Replaced Token Detection as a Self-Supervised Task

Concept

Model Usage After Replaced Token Detection Training

Once the training for Replaced Token Detection is finished, the two models involved have different fates. The generator, having served its purpose of creating a challenging training task, is discarded. The discriminator's encoder, which has learned rich contextual representations, is preserved and used as the pre-trained model for various downstream natural language understanding tasks.

Updated 2026-04-16

Contributors are: