1Cademy - A machine learning team has just finished pre-training a language model using a two-part system. The first, smaller model corrupted text by replacing some words with plausible alternatives. The second, larger model was then trained to identify which words in the text were original and which were replacements. The teams ultimate goal is to use this work to build a system for classifying the sentiment of customer reviews. What is the most effective and standard next step for the team to take?

Learn Before

Tuning the ALiBi Bias Scalar ( $\beta$ )

Multiple Choice

A machine learning team has just finished pre-training a language model using a two-part system. The first, smaller model corrupted text by replacing some words with plausible alternatives. The second, larger model was then trained to identify which words in the text were original and which were replacements. The team's ultimate goal is to use this work to build a system for classifying the sentiment of customer reviews. What is the most effective and standard next step for the team to take?

Updated 2025-09-26

Contributors are:

Who are from:

β Value	ROUGE Score
0.01	0.35
0.1	0.42
1.0	0.38
10.0	0.29

Learn Before

Related