Learn Before
Challenges of Rating LLM Outputs
Having annotators assign numerical scores to Large Language Model outputs is a difficult process. It is typically challenging to design an annotation standard for numerical ratings that all annotators can easily follow and agree upon, leading to inconsistencies.
0
1
Tags
Foundations of Large Language Models
Ch.2 Generative Models - Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Related
A development team has a pre-trained language model and wants to fine-tune it to produce responses that are more helpful and safe. Their strategy involves first creating a separate model whose sole job is to score how good a given response is, based on human preferences. Which of the following best describes the data and objective used to train this specific 'scoring' model?
You are tasked with aligning a large language model to better follow human preferences using a reward-based approach. Arrange the following high-level stages of the process into the correct chronological order.
Diagnosing Reward Model Failure
Rating LLM Outputs for Reward Models
Challenges of Rating LLM Outputs