1Cademy - A system assigns a worth value to potential text completions, calculated as the exponential of a reward score. Initially, three completions (A, B, C) have reward scores of 2.0, 3.0, and 4.0, respectively. If the reward score for *each* completion is increased by a constant value of 1.0, how does this change affect the ratio of worth between any two completions (e.g., the ratio of worth(B) to worth(A))?

Learn Before

Worth Function in Plackett-Luce Model

Multiple Choice

A system assigns a 'worth' value to potential text completions, calculated as the exponential of a reward score. Initially, three completions (A, B, C) have reward scores of 2.0, 3.0, and 4.0, respectively. If the reward score for each completion is increased by a constant value of 1.0, how does this change affect the ratio of worth between any two completions (e.g., the ratio of worth(B) to worth(A))?

Updated 2025-09-28

Contributors are:

Who are from:

Learn Before

Related