Segment Score as Difference of Sequence Scores
When automating the rating of segments using a strong language model, it can be challenging to obtain human preferences directly at the segment level. One strategy is to use the language model to assign a cumulative rating score to the sequence up to the current segment, and then subtract the score of the sequence up to the previous segment. The score for a specific segment is thus calculated as the difference between the LLM's score for the sequence ending at segment and the score for the sequence ending at segment . The relationship is formulated as: where represents the rating score assigned by the language model.

0
1
Tags
Ch.4 Alignment - Foundations of Large Language Models
Foundations of Large Language Models
Computing Sciences
Foundations of Large Language Models Course
Related
Segment Score as Difference of Sequence Scores
Calculating Segment Score from a Language Model
A team is using a large language model to automate the quality rating of a narrative. The model generates a cumulative quality score for the text after each segment is added. The score for an individual segment is then determined by the increase in the cumulative score that its addition causes. If the calculated score for 'Segment C' is significantly higher than for 'Segment A' and 'Segment B', what is the most precise conclusion one can draw?
Interpreting Negative Segment Scores
Learn After
Training a Reward Model on Segment-Level Scores via Regression Loss
A language model is used to generate a quality score for a piece of text. The text is composed of three distinct segments. The model provides the following cumulative scores:
- Score for the first segment: 0.7
- Score for the first two segments combined: 0.5
- Score for all three segments combined: 0.9
Based on the principle that a segment's individual score is the change in the total score its addition causes, what is the calculated score for the second segment alone?
Analyzing Chatbot Response Quality
A language model generates a cumulative quality score for a text as it is built from four sequential segments. The cumulative scores are as follows:
- Score after Segment 1: 0.6
- Score after Segment 2: 0.9
- Score after Segment 3: 0.7
- Score after Segment 4: 0.8
Based on the principle that an individual segment's score is the change in the cumulative score, arrange Segments 2, 3, and 4 in order from the one that contributed most positively to the overall quality to the one that contributed most negatively.