Short Answer

Mechanism of Decoding Penalties for Reasoning

An AI engineer observes that a language model consistently provides correct but overly brief answers to complex problems, failing to show its work. The engineer decides to apply a penalty during the decoding process to discourage short responses. Explain the underlying mechanism by which this penalty influences the model's token selection process to produce a more detailed, step-by-step reasoning path.

0

1

Updated 2025-10-06

Contributors are:

Who are from:

Tags

Ch.5 Inference - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Analysis in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science