Multiple Choice

A language model is being fine-tuned on a dataset of customer support chat logs to improve its ability to generate helpful responses. The training process is guided by the objective function: θ~=argmaxθ^+(query,response)DlogPrθ^+(responsequery)\tilde{\theta} = \arg \max_{\hat{\theta}^{+}} \sum_{(\text{query},\text{response})\in\mathcal{D}} \log \mathrm{Pr}_{\hat{\theta}^{+}}(\text{response}|\text{query}) During one step of this process, the model processes a single (query, response) pair from the dataset. What is the role of the specific component log Pr(response|query) for this single pair?

0

1

Updated 2025-09-28

Contributors are:

Who are from:

Tags

Ch.2 Generative Models - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Ch.4 Alignment - Foundations of Large Language Models

Analysis in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science