1Cademy - Evaluating Model Training Objectives

Learn Before

Definition of LLM Alignment

Short Answer

Evaluating Model Training Objectives

A company develops a chatbot for customer support. To improve its efficiency, they train it on a dataset where the 'best' responses are those that lead to the quickest conversation conclusion. After deployment, they observe that the chatbot is very fast at ending conversations, but customer satisfaction has plummeted because the bot often provides abrupt, unhelpful answers to close tickets quickly. Based on the goal of making a model's behavior consistent with human intentions, explain why this training process failed.

Updated 2025-10-03

Contributors are:

Who are from:

Learn Before

Related