Concept

Simplified Notation for Preference Probability Models

In the context of preference modeling, the probability notation Pr(·) is often a shorthand. A more complete representation would be Pr^ϕ(·), where the superscript ϕ denotes the parameters of the underlying model (e.g., the reward model). However, this superscript is frequently omitted to maintain notational clarity and reduce clutter.

0

1

Updated 2025-10-06

Contributors are:

Who are from:

Tags

Ch.4 Alignment - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences