1Cademy - Performance Metric for Instruction-Tuned LLMs

Learn Before

Generalization Challenges in Instruction Fine-Tuning

Formula

Performance Metric for Instruction-Tuned LLMs

The effectiveness of an instruction-fine-tuned model, formally represented as $\Pr(\mathbf{y}|\mathbf{c},\mathbf{z})$ , is assessed using a performance metric. This evaluation can be written as $\mathrm{Performance}(\Pr(\mathbf{y}|\mathbf{c},\mathbf{z}))$ or, in a more concise form, as $\mathrm{P}(\mathbf{c},\mathbf{z},\mathbf{y})$ , where $\mathbf{c}$ , $\mathbf{z}$ , and $\mathbf{y}$ represent the instruction, input, and output, respectively.

Updated 2026-05-01

Contributors are:

Who are from:

References

Reference of Foundations of Large Language Models Course
Reference of Foundations of Large Language Models Course

Learn After

Critique of a Unified Performance Metric for AI
A researcher is evaluating an instruction-tuned language model using a performance metric denoted as P(c, z, y). The model is given the following:
- Instruction (c): "Summarize the following text into a single sentence."
- Input (z): "The sun is a star at the center of the Solar System. It is a nearly perfect sphere of hot plasma, with internal convective motion that generates a magnetic field via a dynamo process. It is by far the most important source of energy for life on Earth."
Analyzing Model Performance Components

Learn Before

Related

Learn After