1Cademy - A startup is building an LLM to automatically grade high school history essays. To ensure scalability and rapid deployment, they plan to align the model *exclusively* using AI-generated feedback. The AI feedback system will be trained to check for factual accuracy against a knowledge base, grammatical correctness, and essay length. What is the most significant risk of this alignment strategy?

Learn Before

Comparison of AI Feedback and Human Feedback for LLM Alignment

Multiple Choice

A startup is building an LLM to automatically grade high school history essays. To ensure scalability and rapid deployment, they plan to align the model exclusively using AI-generated feedback. The AI feedback system will be trained to check for factual accuracy against a knowledge base, grammatical correctness, and essay length. What is the most significant risk of this alignment strategy?

Updated 2025-10-10

Contributors are:

Who are from:

Learn Before

Related