Multiple Choice

A researcher aims to guide a language model to generate text with a consistently positive sentiment, penalizing it the moment its internal thought process begins to drift towards negativity, even before negative words are explicitly written. Which approach to designing a penalty function is most suitable for this real-time, internal-state intervention?

0

1

Updated 2025-10-06

Contributors are:

Who are from:

Tags

Ch.5 Inference - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Application in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science