1Cademy - Formula for Generalization Across Tasks

Test Set A: A new, unseen collection of English legal documents to be translated into French.
Test Set B: A collection of diverse tasks, such as writing Python code, composing poetry, and summarizing news articles.

Learn Before

Two Levels of Generalization in Instruction-Tuned LLMs

Formula

Formula for Generalization Across Tasks

Generalization across tasks occurs when an instruction-fine-tuned model's average performance over all new instruction-input pairs is above a predefined threshold value, $\epsilon$ . This condition is mathematically expressed as:

$\frac{1}{|\mathcal{D}|} \sum_{(\mathbf{c}',\mathbf{z}') \in \mathcal{D}} \mathrm{P}(\mathbf{c}',\mathbf{z}',\mathbf{y}') > \epsilon$

where $\mathcal{D}$ is the set of new instruction-input pairs, $(\mathbf{c}',\mathbf{z}')$ represents a specific new instruction and input from the set, and $\mathbf{y}'$ is the corresponding model output.