1Cademy - A researcher aims to guide a language model to generate text with a consistently positive sentiment, penalizing it the moment its internal thought process begins to drift towards negativity, even before negative words are explicitly written. Which approach to designing a penalty function is most suitable for this real-time, internal-state intervention?

Learn Before

Penalty Functions Based on Hidden States

Multiple Choice

A researcher aims to guide a language model to generate text with a consistently positive sentiment, penalizing it the moment its internal thought process begins to drift towards negativity, even before negative words are explicitly written. Which approach to designing a penalty function is most suitable for this real-time, internal-state intervention?

Updated 2025-10-06

Contributors are:

Who are from:

Learn Before

Related