1Cademy - Rationale for Cumulative Objective in Dialogue Models

Learn Before

Training Objective for Multi-Round Dialogue Models

Short Answer

Rationale for Cumulative Objective in Dialogue Models

A simplified, but flawed, approach to training a multi-round dialogue model might be to only maximize the probability of the final response in a conversation, conditioned on the entire history. Explain why the standard training objective, which sums the log-probabilities of every model response in the conversation, is a more effective approach for developing a capable dialogue agent.

Updated 2025-10-09

Contributors are:

Who are from:

Learn Before

Related