Concept

Using an Oracle Model to Distinguish Model vs. Search Errors

A method for diagnosing a model's performance issues involves using a more powerful model as an 'oracle' to evaluate the outputs of an older, weaker model. The oracle selects the best response from an N-best list generated by the old model. The performance difference between this 'oracle output' and the old model's top-ranked output indicates the type of error. A significant difference suggests a 'model error,' where the model is fundamentally incapable of generating the correct answer. A small difference points to a 'search error,' meaning the model produced a good answer but failed to rank it as the top choice.

0

1

Updated 2025-10-07

Contributors are:

Who are from:

Tags

Ch.5 Inference - Foundations of Large Language Models

Foundations of Large Language Models

Computing Sciences

Foundations of Large Language Models Course