Learn Before
Comparison

Dense vs. Sparse Rewards

Reinforcement learning feedback can be categorized based on its frequency. Dense rewards are provided immediately and frequently, which generally makes policy training easier and more efficient. In contrast, sparse rewards are given only upon task completion. While dense feedback is often preferred, many scenarios, particularly in NLP, are inherently structured with sparse rewards.

0

1

Updated 2026-05-01

Contributors are:

Who are from:

Tags

Ch.4 Alignment - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences