1Cademy - An AI team fine-tunes a language model exclusively on a dataset for a single task: translating English legal documents into French. The model is then evaluated on two test sets. - **Test Set A:** A new, unseen collection of English legal documents to be translated into French. - **Test Set B:** A collection of diverse tasks, such as writing Python code, composing poetry, and summarizing news articles. The model performs very well on Test Set A but performs poorly on Test Set B. What does this

Learn Before

Two Levels of Generalization in Instruction-Tuned LLMs

Multiple Choice

An AI team fine-tunes a language model exclusively on a dataset for a single task: translating English legal documents into French. The model is then evaluated on two test sets.

Test Set A: A new, unseen collection of English legal documents to be translated into French.
Test Set B: A collection of diverse tasks, such as writing Python code, composing poetry, and summarizing news articles.

The model performs very well on Test Set A but performs poorly on Test Set B. What does this

Updated 2025-10-06

Contributors are:

Who are from:

Learn Before

Related