1Cademy - A startup is developing a language model to provide personalized financial advice to a global audience. To ensure the models advice is safe and helpful, they plan to fine-tune it using preference data collected from a small team of 10 financial analysts, all from the companys headquarters in New York City. Based on the known challenges of using human-provided data for model alignment, what is the most critical potential flaw in this strategy?

Learn Before

Limitations of Human Feedback in LLM Alignment

Multiple Choice

A startup is developing a language model to provide personalized financial advice to a global audience. To ensure the model's advice is safe and helpful, they plan to fine-tune it using preference data collected from a small team of 10 financial analysts, all from the company's headquarters in New York City. Based on the known challenges of using human-provided data for model alignment, what is the most critical potential flaw in this strategy?

Updated 2025-09-26

Contributors are:

Who are from:

Learn Before

Related