1Cademy - When the supervised fine-tuning objective is written as $\tilde{\theta} = \arg \max_{\theta} \sum_{(\mathbf{x},\mathbf{y})\in\mathcal{D}} \log \mathrm{Pr}_{\theta}(\mathbf{y}|\mathbf{x})$, the parameters denoted by $\theta$ are typically initialized from a random distribution before the optimization process begins.

Learn Before

Notational Simplification in Fine-Tuning Formulas

True/False

When the supervised fine-tuning objective is written as $\tilde{\theta} = \arg \max_{\theta} \sum_{(\mathbf{x},\mathbf{y})\in\mathcal{D}} \log \mathrm{Pr}_{\theta}(\mathbf{y}|\mathbf{x})$ , the parameters denoted by $\theta$ are typically initialized from a random distribution before the optimization process begins.

Updated 2025-10-02

Contributors are:

Who are from:

Learn Before

Related