Learn Before
Essay

Evaluating Core Difficulties in Model Behavior Guidance

Imagine you are tasked with creating a set of rules for a powerful AI assistant to ensure it is always 'helpful and harmless'. Critically evaluate why simply creating a comprehensive list of rules and training the AI on examples of good behavior is an insufficient strategy. In your answer, analyze at least two distinct underlying problems that make this task fundamentally difficult, connecting the nature of human expectations to the practical limitations of training such a system.

0

1

Updated 2025-10-06

Contributors are:

Who are from:

Tags

Ch.2 Generative Models - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Ch.4 Alignment - Foundations of Large Language Models

Evaluation in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science