Learn Before
Formatting a Training Sample for a Text-to-Text Model
A data scientist is preparing a dataset to train a text-to-text model for a question-answering task. One of the data points is the question 'What is the chemical symbol for gold?' and its corresponding answer 'Au'. Construct the complete, single-line training sample for this data point, following the specific format required by the model. Your answer should include the standard prefix for this task, the question, the separator, the start-of-sequence token, and the answer.
0
1
Tags
Ch.1 Pre-training - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Application in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
A researcher is preparing data for a unified text-to-text model. To have the model perform a question-answering task, the input data must be structured as a single text string that begins with a specific instruction, followed by the question itself. Based on this requirement, which of the following examples is formatted correctly to serve as the model's input?
Formatting a Training Sample for a Text-to-Text Model
Analyzing an Incorrect T5 Question-Answering Sample