Analyzing a Language Model's Training Method
Based on the training approach described in the case study, what is the primary purpose of intentionally replacing a correct word (like 'meal') with a random, incorrect word (like 'bicycle')? Explain how this helps the model learn.
0
1
Tags
Ch.1 Pre-training - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Analysis in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
A language model is being trained using a technique where some words in the input are altered to help the model learn. Consider the original input sequence:
[CLS] My dog chased the ball . [SEP] He brought it back . [SEP]. If the token 'ball' is selected to be replaced by a random word from the model's vocabulary, which of the following represents the most likely resulting sequence?Analyzing a Language Model's Training Method
Analyzing a Corrupted Input Sequence