Multiple Choice

A language model is tasked with generating a five-token sequence (x0,x1,x2,x3,x4x_0, x_1, x_2, x_3, x_4) in the specific permuted order: x3β†’x1β†’x4β†’x0β†’x2x_3 \rightarrow x_1 \rightarrow x_4 \rightarrow x_0 \rightarrow x_2. At each step, the model predicts the next token in the permutation using the embeddings (e.g., ei\mathbf{e}_i for token xix_i) of all previously generated tokens as context. Which of the following correctly represents the conditional probability for the third step of this generation process?

0

1

Updated 2025-09-26

Contributors are:

Who are from:

Tags

Ch.1 Pre-training - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Application in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science

Related