Learn Before
Improving Soft Prompt Cohesion
Based on the team's problem, what specific architectural enhancement could they implement to their prompting method to encourage the model to interpret the soft prompt as a single, cohesive instruction? Describe the enhancement and explain the reasoning behind your suggestion.
0
1
Tags
Ch.3 Prompting - Foundations of Large Language Models
Foundations of Large Language Models
Computing Sciences
Foundations of Large Language Models Course
Application in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
A machine learning team is developing a system where a task is specified to a large language model using a sequence of learnable numerical vectors instead of natural language text. They are considering an additional step: before passing these vectors to the main model, they first process the entire sequence of vectors through a separate, smaller neural network. The output of this smaller network is then used as the actual prompt. What is the most likely trade-off the team is making by adding this extra processing step?
A system uses a dedicated sequence model to process a soft prompt before it is used by a main large language model. Arrange the following steps in the correct chronological order of data flow.
Improving Soft Prompt Cohesion