1Cademy - When using the formula `Score(y) = π(y|x) * exp(r(x, y))` to adjust the likelihood of a potential output `y`, setting the reward `r(x, y)` to zero will cause the final score for that output to become zero, effectively eliminating it from consideration.

Learn Before

Formula for Re-weighting a Probability Distribution with a Reward Function

True/False

When using the formula Score(y) = π(y|x) * exp(r(x, y)) to adjust the likelihood of a potential output y, setting the reward r(x, y) to zero will cause the final score for that output to become zero, effectively eliminating it from consideration.

Updated 2025-10-07

Contributors are: