1Cademy - In a multi-layer transformer model adapted for prefix-based tuning, the input to any given layer `L` is formed by prepending a set of layer-specific trainable vectors (the prefix) to the sequence representation from the previous layer. After all computations within layer `L` are finished, what is the precise composition of the input sequence for the next layer, `L+1`?

Learn Before

Inter-Layer Data Flow in Prefix-Tuning

Multiple Choice

In a multi-layer transformer model adapted for prefix-based tuning, the input to any given layer L is formed by prepending a set of layer-specific trainable vectors (the 'prefix') to the sequence representation from the previous layer. After all computations within layer L are finished, what is the precise composition of the input sequence for the next layer, L+1?

Updated 2025-10-02

Contributors are:

Who are from:

Learn Before

Related