Short Answer

Rationale for Cumulative Objective in Dialogue Models

A simplified, but flawed, approach to training a multi-round dialogue model might be to only maximize the probability of the final response in a conversation, conditioned on the entire history. Explain why the standard training objective, which sums the log-probabilities of every model response in the conversation, is a more effective approach for developing a capable dialogue agent.

0

1

Updated 2025-10-09

Contributors are:

Who are from:

Tags

Ch.4 Alignment - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Analysis in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science