Case Study

Evaluating a Penalty Term for Creative Writing

A developer is using a language model to generate creative story continuations. To encourage novelty, they've added a penalty to the decoding process that strongly penalizes any word that has already appeared in the generated output. However, the resulting text is often ungrammatical and nonsensical, using strange synonyms for common words like 'the' and 'is'. Evaluate the developer's approach. Why is this penalty strategy failing, and what specific modification would you recommend to better achieve the goal of generating creative yet coherent text?

0

1

Updated 2025-10-02

Contributors are:

Who are from:

Tags

Ch.5 Inference - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Evaluation in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science