Learn Before
Concept

Reward Function for Reinforcement Learning Trajectories

In reinforcement learning for helicopter control, a reward function scores how good each possible trajectory is. The reward may penalize crashes heavily and reward safe landings, while trading off smoothness, landing location, ride roughness, and other desiderata.

0

1

Updated 2026-05-25

Contributors are:

Who are from:

Tags

Data Science

Foundations of Large Language Models Course

Computing Sciences

Machine Learning

Deep Learning

Supervised Learning

Dive into Deep Learning @ D2L

Machine Learning Strategy