1Cademy - State in the Context of LLMs

Learn Before

State in Reinforcement Learning

Concept

State in the Context of LLMs

In language modeling, a state at a specific time step, denoted as $s$ , is defined as the sequence of tokens observed up to that point. This sequence serves as the context the model utilizes to predict the subsequent token. For instance, when predicting the next token at time step $t$ , the state can be mathematically defined as $(\mathbf{x}, \mathbf{y}_{< t})$ , where $\mathbf{x}$ represents the initial input and $\mathbf{y}_{< t}$ represents the generated tokens so far.