1Cademy - Bridging Language Modeling and Reinforcement Learning Notations in RLHF

Learn Before

Reinforcement Learning from Human Feedback (RLHF)

Concept

Bridging Language Modeling and Reinforcement Learning Notations in RLHF

In the context of Reinforcement Learning from Human Feedback (RLHF), concepts are often explained using standard reinforcement learning notation to simplify the presentation, even though the underlying system is a language model. This adaptation from typical language modeling notation requires establishing a clear correspondence between the two systems to fully understand how RL principles are applied to LLMs.

Updated 2026-01-15

Contributors are: