Learn Before
Concept

Loss Function for RNN

Given the predictions for each time step y^(t)\hat{y}^{(t)}, together with the ground truth labels y(t)y^{(t)}, we can calculate the loss on step t J(t)=L(y^(t),y(t))J^{(t)} = \mathcal{L} (\hat{y}^{(t)}, y^{(t)}) To get the overall loss, we only need to average these: J=1Tt=1TJ(t)J = \frac{1}{T} \sum_{t=1}^T J^{(t)}

Image 0

0

1

Updated 2026-05-02

Tags

Data Science

Ch.3 Prompting - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Related