Learn Before
Example of a Prompt Template for Sample Generation in Self-Instruct
A prompt template for generating a new sample in the Self-Instruct process provides the LLM with existing samples to learn from. The template instructs the model to create a new sample, including an instruction, user input, and output. An example template is: "You are provided with a set of input-output samples, each composed of an instruction, a user input, and an output. Please generate a new sample based on these. Sample 1: Input: Output: Sample 2: Input: Output: New Sample: ".
0
1
Tags
Ch.4 Alignment - Foundations of Large Language Models
Foundations of Large Language Models
Computing Sciences
Foundations of Large Language Models Course
Related
Filtering in Self-Instruct
In an automated process for generating training data, a language model has just created a new, unique instruction: 'Write a product description for a fictional gadget.' To complete the data instance for this instruction, what is the essential next task for the model?
Example of a Prompt Template for Sample Generation in Self-Instruct
An automated system for creating training data has just generated a new instruction: 'Summarize the provided text into a single sentence.' In the subsequent step, the system produces the following text: 'The main character overcomes several obstacles to achieve their lifelong dream.' Based on the requirements for creating a complete data instance, what crucial component is missing from this generated sample?
Diagnosing a Flaw in an Automated Data Generation Process
Learn After
A research team is using an automated process to generate new training data. The process involves showing a language model a few high-quality examples, where each example consists of an 'instruction', a user 'input', and a corresponding 'output'. The goal is for the model to then create a completely new, well-formed example that follows the same three-part structure. Which of the following prompts, given to the model after the examples, would be most effective and precise for this task?
Troubleshooting an Automated Data Generation Process
Evaluating a Prompt Template for Data Generation