1Cademy - Action in the Context of LLMs

Learn Before

Action in Reinforcement Learning

Definition

Action in the Context of LLMs

When applying reinforcement learning to Large Language Models, an action, denoted as $a$ , corresponds to a possible decision the agent can make. Specifically, an action represents a predicted token chosen from the model's vocabulary.

Updated 2026-05-01

Contributors are:

Who are from:

References

Reference of Foundations of Large Language Models Course
Reference of Foundations of Large Language Models Course

Learn After

Policy Formula for LLMs in Reinforcement Learning
A language model is generating a response to the prompt 'The best way to learn a new skill is to...'. So far, it has produced the sequence 'The best way to learn a new skill is to practice'. At this exact point in the generation process, what constitutes the model's next 'action' within a reinforcement learning framework?
Comparing 'Action' in Different Reinforcement Learning Scenarios
Identifying the Action in LLM Fine-Tuning

Learn Before

Related

Learn After