Short Answer

Calculating a Missing Segment Score

A language model's output is divided into four segments. A reward model assigns scores to each segment. The scores for the first three segments are +1.2, -0.5, and +0.8. If the total reward for the entire output is calculated to be 2.0, what is the score for the fourth segment? Explain your calculation.

0

1

Updated 2025-10-08

Contributors are:

Who are from:

Tags

Ch.4 Alignment - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Application in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science