True/False

In the context of learning a compressed representation of a long text, consider the optimization formula: hat(σ) = argmin_σ s(hat(y), hat(y)_σ), where hat(y) is the prediction from the full text and hat(y)_σ is the prediction from the compressed representation σ. If the function s(·,·) were changed from a dissimilarity measure (e.g., a loss function) to a similarity measure (e.g., a cosine similarity score), the argmin operator should be replaced with argmax to correctly identify the optimal compressed representation hat(σ).

0

1

Updated 2025-10-08

Contributors are:

Who are from:

Tags

Ch.4 Alignment - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Analysis in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science