1Cademy - Efficiency of Prefix Fine-Tuning

Learn Before

Prefix Fine-Tuning

Concept

Efficiency of Prefix Fine-Tuning

Prefix fine-tuning is highly computationally efficient because it targets a small portion of the model's parameters. Specifically, it introduces an additional $L \times n \times d$ parameters, where $L$ is the number of Transformer layers, $n$ is the number of prefixes, and $d$ is the dimensionality of each prefix. Since this added parameter count is significantly smaller than the total number of parameters in the Large Language Model (while the original parameters remain fixed), the fine-tuning process requires much less computational overhead.