1Cademy - A team is developing a system to score individual sentences within a long, multi-paragraph response generated by a model. They observe that the system sometimes gives a high score to a sentence that, while well-written in isolation, directly contradicts information presented in a previous paragraph of the same response. Which of the following is the most likely reason for this evaluation error?

Learn Before

Input Formulation for Segment-Based Reward Computation

Multiple Choice

A team is developing a system to score individual sentences within a long, multi-paragraph response generated by a model. They observe that the system sometimes gives a high score to a sentence that, while well-written in isolation, directly contradicts information presented in a previous paragraph of the same response. Which of the following is the most likely reason for this evaluation error?

Updated 2025-09-28

Contributors are:

Who are from:

Learn Before

Related