Multiple Choice

A small research lab with limited computational resources and a fixed grant timeline plans to align its new language model. Their strategy involves an iterative fine-tuning process where the language model is repeatedly updated based on guidance from a complex, separately trained reward model. Which of the following represents the most significant risk this lab faces with their chosen strategy?

0

1

Updated 2025-10-02

Contributors are:

Who are from:

Tags

Ch.5 Inference - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Evaluation in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science