Formula

Formula for a Negative Reward Function r(x, y, ȳ)

The function r(x,y,yˉ)=1r(\mathbf{x}, \mathbf{y}, \bar{\mathbf{y}}) = -1 assigns a constant negative value. This function's arguments are the vectors x\mathbf{x} and y\mathbf{y}, and the average vector yˉ\bar{\mathbf{y}}. In machine learning, a value of -1 often signifies a negative reward, a penalty, or indicates a negative sample.

0

1

Updated 2025-10-06

Contributors are:

Who are from:

Tags

Ch.4 Alignment - Foundations of Large Language Models

Foundations of Large Language Models

Computing Sciences

Related