1Cademy - A research team has a language model that was pre-trained exclusively on text segments with a maximum length of 2,048 tokens. The teams goal is to adapt this model to accurately summarize legal documents that are frequently 5,000 tokens long, a task at which the model currently performs poorly. Given this specific goal, which of the following fine-tuning strategies is most likely to be effective?

Learn Before

Fine-tuning on Longer Sequences for Enhanced Length Extrapolation

Multiple Choice

A research team has a language model that was pre-trained exclusively on text segments with a maximum length of 2,048 tokens. The team's goal is to adapt this model to accurately summarize legal documents that are frequently 5,000 tokens long, a task at which the model currently performs poorly. Given this specific goal, which of the following fine-tuning strategies is most likely to be effective?

Updated 2025-09-29

Contributors are:

Who are from:

Learn Before

Related