Case Study

Analyzing Prediction Context in Arbitrary Order Generation

A language model is tasked with generating a 5-token sequence, originally ordered as x_1, x_2, x_3, x_4, x_5. Instead of a standard left-to-right approach, it uses the following arbitrary generation order: x_3 -> x_5 -> x_1 -> x_4 -> x_2. At which step in this generation process is the prediction task analogous to a masked language modeling task? Explain your reasoning by describing the context available for the prediction at that specific step.

0

1

Updated 2025-10-05

Contributors are:

Who are from:

Tags

Ch.1 Pre-training - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Analysis in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science