Learn Before
A team is implementing a reinforcement learning-based system to generate optimized prompts. The system consists of a base large language model (LLM) and a smaller, trainable adaptor network that functions as the policy network. Arrange the following steps to describe a single iteration of the training loop for this system.
0
1
Tags
Ch.3 Prompting - Foundations of Large Language Models
Foundations of Large Language Models
Computing Sciences
Foundations of Large Language Models Course
Analysis in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
A research team is developing a system to automatically generate effective prompts for a specific task. They integrate a small, trainable network module with a very large, pre-trained language model. During the training process, they only update the parameters of this small module, keeping the original large model's parameters unchanged. The training is guided by rewards from a separate evaluation model that assesses the quality of the generated prompts. Which of the following best analyzes the primary advantage of this training approach?
A team is implementing a reinforcement learning-based system to generate optimized prompts. The system consists of a base large language model (LLM) and a smaller, trainable adaptor network that functions as the policy network. Arrange the following steps to describe a single iteration of the training loop for this system.
Troubleshooting a Prompt Generation System