1Cademy - A team is developing a reward model to score segments of text generated by a language model. The standard approach calculates a segments score using the initial prompt, the complete generated output, and the specific segment being evaluated. To improve efficiency, a developer suggests modifying the process to calculate the score using only the initial prompt and the specific segment, omitting the rest of the generated output. What is the most significant analytical flaw in this modified approac

Learn Before

Segment-Based Reward Score Formula

Multiple Choice

A team is developing a reward model to score segments of text generated by a language model. The standard approach calculates a segment's score using the initial prompt, the complete generated output, and the specific segment being evaluated. To improve efficiency, a developer suggests modifying the process to calculate the score using only the initial prompt and the specific segment, omitting the rest of the generated output. What is the most significant analytical flaw in this modified approac

Updated 2025-10-02

Contributors are:

Who are from:

Learn Before

Related