1Cademy - Core Computational Task in Autoregressive Generation

Learn Before

Direct Computation of Output Sequence Log-Probability in LLMs

Concept

Core Computational Task in Autoregressive Generation

The fundamental computational task of an autoregressive language model is to model the conditional probability of the next token given the preceding context, Pr(y_i|x, y_{<i}). A key requirement for practical implementation is that this computation must be performed in an efficient manner.