Case Study

Impact of Pre-training Data on Instruction Following

A research lab pre-trains two large language models, Model X and Model Y, on different datasets of the same size. After pre-training is complete, and with no further training of any kind, both models are given the same novel prompt: 'Translate the following English sentence into French: The cat is sleeping.' Analyze the case study below and determine which model is more likely to succeed, explaining your reasoning.

0

1

Updated 2025-10-06

Contributors are:

Who are from:

Tags

Ch.1 Pre-training - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Analysis in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science