Formula

Log-Probability of a Ranked Sequence

The log-probability of an ordered list or ranked sequence of preferences, Y˚\mathring{Y}, given an input x\mathbf{x}, can be defined as the sum of the conditional log-probabilities at each stage of selection. For an ordered list Y˚\mathring{Y} given by yj1yj2yjN\mathbf{y}_{j_1} \succ \mathbf{y}_{j_2} \succ \cdots \succ \mathbf{y}_{j_N}, the overall log-probability is expressed as: logPr(Y˚x)=logPr(yj1yj2yjNx)\log \Pr(\mathring{Y} | \mathbf{x}) = \log \Pr(\mathbf{y}_{j_1} \succ \mathbf{y}_{j_2} \succ \cdots \succ \mathbf{y}_{j_N} | \mathbf{x}) =logPr(yj1x,{yj1,yj2,...,yjN})+logPr(yj2x,{yj2,...,yjN})++logPr(yjNx,{yjN})= \log \Pr(\mathbf{y}_{j_1} | \mathbf{x}, \{\mathbf{y}_{j_1},\mathbf{y}_{j_2},...,\mathbf{y}_{j_N}\}) + \log \Pr(\mathbf{y}_{j_2} | \mathbf{x}, \{\mathbf{y}_{j_2},...,\mathbf{y}_{j_N}\}) + \cdots + \log \Pr(\mathbf{y}_{j_N} | \mathbf{x}, \{\mathbf{y}_{j_N}\}) =k=1NlogPr(yjkx,Y˚k)= \sum_{k = 1}^{N} \log \Pr(\mathbf{y}_{j_k} | \mathbf{x}, \mathring{Y}_{\ge k}) where Y˚k\mathring{Y}_{\ge k} represents the subset of the list of outputs that remain unselected at the kk-th stage, i.e., Y˚k={yjk,...,yjN}\mathring{Y}_{\ge k} = \{\mathbf{y}_{j_k},...,\mathbf{y}_{j_N}\}.

Image 0

0

1

Updated 2026-05-02

Contributors are:

Who are from:

Tags

Ch.4 Alignment - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Related