Learn Before
Contrasting Environments in Learning Systems
Consider a learning agent designed to play a chess game against an opponent versus a learning agent designed to generate helpful summaries of long documents. Contrast the nature of the 'environment' for each of these two agents. In your answer, explain how the environment provides feedback and influences the learning process in both scenarios.
0
1
Tags
Ch.4 Alignment - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Analysis in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
An AI company is training a language model to be a helpful and harmless assistant. When the model generates a response to a user prompt, a separate, pre-trained 'preference model' scores the response based on its helpfulness and harmlessness. This score is then used to update the language model's parameters. In this training setup, which component best represents the 'environment' for the language model agent?
Contrasting Environments in Learning Systems
Deconstructing an LLM's Learning Framework