Multiple Choice

A language model is tasked with generating a five-token sequence (x0,x1,x2,x3,x4x_0, x_1, x_2, x_3, x_4) in the specific permuted order: x_3 rightarrow x_1 rightarrow x_4 rightarrow x_0 rightarrow x_2. At each step, the model predicts the next token in the permutation using the embeddings (e.g., ei\mathbf{e}_i for token xix_i) of all previously generated tokens as context. Which of the following correctly represents the conditional probability for the third step of this generation process?

0

1

Updated 2025-09-26

Contributors are:

Who are from:

Tags

Ch.1 Pre-training - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Application in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science