Learn Before
Self-Reinforcing Training Strategy for a Chatbot
Based on the training methodology described in the case study, evaluate one major advantage and one significant disadvantage. Explain your reasoning for each.
0
1
Tags
Ch.3 Prompting - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Evaluation in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
Log-Probability Loss with Model-Generated Target
A research team is training a generative model using a method where the learning target for any given input is the output that the model itself currently calculates as having the highest probability. This self-generated target is then used to update the model's parameters. Which statement best analyzes a key implication of this training approach?
Self-Reinforcing Training Strategy for a Chatbot
Contrasting Learning Target Methodologies