1Cademy - In a Transformer architecture modified for prefix-tuning, the hidden state representations corresponding to the trainable prefix vectors are passed along with the main inputs hidden states to the subsequent layer to ensure the model has access to the learned task-specific information at every stage.

Learn Before

Output Selection in a Prefix-Tuned Transformer Layer

True/False

In a Transformer architecture modified for prefix-tuning, the hidden state representations corresponding to the trainable prefix vectors are passed along with the main input's hidden states to the subsequent layer to ensure the model has access to the learned task-specific information at every stage.

Updated 2025-10-06

Contributors are:

Who are from:

Learn Before

Related