1Cademy - Formal Representation of an Instruction-Tuned LLM

Learn Before

Generalization Challenges in Instruction Fine-Tuning

Formula

Formal Representation of an Instruction-Tuned LLM

An instruction-fine-tuned Large Language Model can be formally represented as a conditional probability distribution, denoted as $\Pr(\mathbf{y}|\mathbf{c},\mathbf{z})$ . In this mathematical expression, $\mathbf{c}$ stands for the instruction provided, $\mathbf{z}$ represents the user's input, and $\mathbf{y}$ is the corresponding generated model output or response.