Learn Before
Interpreting Model Notation in a Research Context
A colleague is reading a research paper abstract that states: 'We introduce a novel neural network for a task. The model is trained to maximize the conditional probability p(y|x), where x is the input and y is the output.' Based on this, your colleague concludes that the model must be non-parametric (i.e., has no learnable parameters) because the probability expression does not explicitly list any parameters. Evaluate your colleague's conclusion and justify your reasoning.
0
1
Tags
Ch.1 Pre-training - Foundations of Large Language Models
Foundations of Large Language Models
Computing Sciences
Foundations of Large Language Models Course
Evaluation in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
A research paper on language models presents the probability of an output token
ygiven an input contextxin two different ways:Expression 1:
p(y | x; W, θ)Expression 2:p(y | x)Assuming both expressions refer to the same underlying model where
Wandθare the model's parameters, what is the most accurate interpretation of the relationship between them?Interpreting Model Notation in a Research Context
In the context of parameterized machine learning models, the mathematical expression
p(y|x)indicates that the probability of outputygiven inputxis calculated without relying on any learned model parameters.