1Cademy - A research team is building a new language model for natural language understanding tasks. They have a fixed model architecture and a large computational budget. They are debating the most effective pre-training strategy. Based on the primary findings demonstrated by subsequent improvements on encoder-only models, which approach is most likely to yield the best performance?

Learn Before

RoBERTa

Multiple Choice

A research team is building a new language model for natural language understanding tasks. They have a fixed model architecture and a large computational budget. They are debating the most effective pre-training strategy. Based on the primary findings demonstrated by subsequent improvements on encoder-only models, which approach is most likely to yield the best performance?

Updated 2025-10-01

Contributors are:

Who are from:

Learn Before

Related