Learn Before
Analyzing Model Update Notation
Analyze the junior developer's report. Is their new notation, $Pr_{\theta_{new}}^{greedy}(\cdot)$, an accurate representation of the change they made? Explain your reasoning, and if you disagree, provide the correct notation.
0
1
Tags
Ch.4 Alignment - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Analysis in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
A researcher is comparing two language models. Model A is defined by a set of parameters . Model B is a version of Model A that has been fine-tuned on a new dataset, resulting in a new set of parameters, . The researcher wants to compare the probability of each model generating the word 'innovative' given the same input context and using the same sampling strategy, . Which of the following mathematical expressions accurately represents this comparison?
An AI engineer is working with a pre-trained Large Language Model, whose probability distribution is represented by . The engineer decides to change the method used to select the next word from the model's output probabilities, switching from a greedy approach to a top-k sampling approach. The model's underlying weights and biases are not modified. Which component of the notation would need to be updated to reflect this change?
Policy Notation for Autoregressive Models (π_θ)
Analyzing Model Update Notation