1Cademy - A research team is pre-training a new large language model using a massive text corpus and significant computational resources. They are debating whether to include an objective where the model must predict if two text segments are consecutive in the original source. Based on key findings from large-scale model pre-training experiments, what is the most well-supported conclusion the team should reach regarding this objective?

Learn Before

Impact of Removing NSP Loss in RoBERTa

Multiple Choice

A research team is pre-training a new large language model using a massive text corpus and significant computational resources. They are debating whether to include an objective where the model must predict if two text segments are consecutive in the original source. Based on key findings from large-scale model pre-training experiments, what is the most well-supported conclusion the team should reach regarding this objective?

Updated 2025-09-29

Contributors are:

Who are from:

Learn Before

Related