Multiple Choice

A development team is training a language model to generate step-by-step solutions to complex logic puzzles. The primary objective is to improve the model's ability to construct a valid and coherent reasoning path, not just to arrive at the correct final conclusion. The team plans to use human annotators to provide feedback on the model's generated solutions. Which of the following annotation strategies is most directly aligned with improving the model's reasoning process?

0

1

Updated 2025-09-26

Contributors are:

Who are from:

Tags

Ch.5 Inference - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Evaluation in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science