1Cademy - In a two-model pre-training setup, a small generator model first processes an input sentence by masking some words and then filling those masked positions with its own predictions. The resulting, potentially altered, sentence is then passed to a larger discriminator model. What is the most critical function of the generators output in this process?

Learn Before

The Generator in Replaced Token Detection

Multiple Choice

In a two-model pre-training setup, a small 'generator' model first processes an input sentence by masking some words and then filling those masked positions with its own predictions. The resulting, potentially altered, sentence is then passed to a larger 'discriminator' model. What is the most critical function of the generator's output in this process?

Updated 2025-09-29

Contributors are:

Who are from:

Learn Before

Related