1Cademy - Impact of Pre-training Data on Instruction Following

Learn Before

Acquiring Instruction Knowledge During Pre-training

Case Study

Impact of Pre-training Data on Instruction Following

A research lab pre-trains two large language models, Model X and Model Y, on different datasets of the same size. After pre-training is complete, and with no further training of any kind, both models are given the same novel prompt: 'Translate the following English sentence into French: The cat is sleeping.' Analyze the case study below and determine which model is more likely to succeed, explaining your reasoning.

Updated 2025-10-06

Contributors are:

Who are from:

Learn Before

Related