1Cademy - In the context of learning a compressed representation of a long text, consider the optimization formula: `hat(σ) = argmin_σ s(hat(y), hat(y)_σ)`, where `hat(y)` is the prediction from the full text and `hat(y)_σ` is the prediction from the compressed representation `σ`. If the function `s(·,·)` were changed from a dissimilarity measure (e.g., a loss function) to a similarity measure (e.g., a cosine similarity score), the `argmin` operator should be replaced with `argmax` to correctly identify t

Learn Before

Formula for Optimizing Soft Prompts via Context Compression

True/False

In the context of learning a compressed representation of a long text, consider the optimization formula: hat(σ) = argmin_σ s(hat(y), hat(y)_σ), where hat(y) is the prediction from the full text and hat(y)_σ is the prediction from the compressed representation σ. If the function s(·,·) were changed from a dissimilarity measure (e.g., a loss function) to a similarity measure (e.g., a cosine similarity score), the argmin operator should be replaced with argmax to correctly identify t

Updated 2025-10-08

Contributors are:

Who are from:

Learn Before

Related