Multiple Choice

A team is refining a language model that generates step-by-step solutions to complex problems. For each reasoning step, the model provides a confidence score indicating its certainty in the step's correctness. The team has a limited budget for human annotators to review and correct the model's reasoning. To maximize the model's performance improvement with this limited budget, which of the following types of reasoning steps should the team prioritize for annotation?

0

1

Updated 2025-10-02

Contributors are:

Who are from:

Tags

Ch.5 Inference - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Analysis in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science