Learn Before
A language model is being trained on the sequence: ⟨s⟩ Translate to Spanish: The cat sat. El gato se sentó. ⟨/s⟩. To effectively teach the model how to perform the translation, on which part of the sequence should the training loss be calculated?
0
1
Tags
Ch.2 Generative Models - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Application in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
A language model is being trained on the sequence:
⟨s⟩ Translate to Spanish: The cat sat. El gato se sentó. ⟨/s⟩. To effectively teach the model how to perform the translation, on which part of the sequence should the training loss be calculated?Debugging a Chatbot Training Process
Rationale for Sub-sequence Loss Calculation