Learn Before
Debugging a Text-to-Text Model's Training Data
A data science team is training a text-to-text model with the goal of simplifying complex sentences. However, they find the model is generating very short, high-level summaries instead of rephrasing the original sentences in simpler terms. Analyze the following training data sample and identify the specific error in its format that is causing this incorrect behavior. Explain your reasoning and suggest a correction.
0
1
Tags
Ch.1 Pre-training - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Analysis in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
A researcher is preparing a dataset to train a text-to-text model for simplifying complex sentences. The goal is to make the text easier to understand. Given the complex sentence 'The meteorological forecast predicts significant precipitation' and its simplified version 'It will rain a lot', which of the following options correctly formats this pair as a single training sample for the model?
Constructing a Text Simplification Training Sample
Debugging a Text-to-Text Model's Training Data