logo
How it worksCoursesResearch CommunitiesBenefitsAbout Us
Schedule Demo
Learn Before
  • Notation for the RLHF Reward Model

Matching

In the context of evaluating a language model's output, a function is commonly expressed as r(x,y)r(\mathbf{x}, y)r(x,y). Match each component of this notation to its correct description.

0

1

Updated 2025-10-03

Contributors are:

Gemini AI
Gemini AI
🏆 2

Who are from:

Google
Google
🏆 2

Tags

Ch.4 Alignment - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Comprehension in Revised Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science

Related
  • A language model is given the input prompt, 'Write a short poem about a rainy day.' It generates the response, 'The sky weeps, and the world listens.' A separate evaluation model then assesses this response for the given prompt and assigns it a quality score of 9.2. If this evaluation process is represented by the function r(x,y)r(\mathbf{x}, y)r(x,y), which option correctly assigns the elements of this scenario to the function's variables?

  • In the context of evaluating a language model's output, a function is commonly expressed as r(x,y)r(\mathbf{x}, y)r(x,y). Match each component of this notation to its correct description.

  • Reward Function as a Linear Transformation of the Last Hidden State

  • Aggregated Reward as the Sum of Segment-Based Rewards

  • Interpreting Reward Model Notation

logo 1cademy1Cademy

Optimize Scalable Learning and Teaching

How it worksCoursesResearch CommunitiesBenefitsAbout Us
TermsPrivacyCookieGDPR

Contact Us

iman@honor.education

Follow Us




© 1Cademy 2026

We're committed to OpenSource on

Github