Learn Before
Short Answer

The Role of Parameters in an LLM Policy

When a large language model is trained using reinforcement learning, the model itself is often referred to as the 'policy'. In this context, what are the specific, adjustable components that constitute the 'parameters' of this policy, and what is their fundamental role during the training process?

0

1

Updated 2025-10-03

Contributors are:

Who are from:

Tags

Ch.4 Alignment - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Comprehension in Revised Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science