Analyzing Ambiguous AI Training Objectives
A city council wants to use a language model to summarize thousands of public comments on a controversial proposal to build a new factory. The model is instructed to 'prioritize the most helpful and constructive feedback' to guide the council's decision. The comments include passionate arguments from local residents about potential noise and pollution, detailed economic forecasts from business leaders about job creation, and urgent warnings from environmental scientists about ecosystem damage. Analyze why the instruction to prioritize 'helpful and constructive feedback' is a challenging objective. Identify and explain at least two distinct difficulties that stem from the ambiguity of human preferences in this context.
0
1
Tags
Ch.4 Alignment - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Analysis in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
Analyzing Ambiguous AI Training Objectives
A research team is trying to train a language model to generate 'engaging and creative' stories. They hire a large group of people to rate thousands of stories on a scale of 1 to 5 for both 'engagement' and 'creativity'. Despite collecting a massive dataset, they find that the model trained on these ratings often produces stories that are formulaic or uninspired. Which of the following statements best analyzes the most fundamental reason for this failure?
Evaluating an AI Content Generation Project