Formula

Connectionist Temporal Classification Loss Function

A Connectionist Temporal Classification (CTC) based automatic speech recognition system uses a negative log-likelihood loss with a special CTC loss function:

LCTC=(X,Y)DlogPCTC(YX)L_{CTC} = \sum_{(X,Y) \in D} -\log P_{CTC}(Y \mid X)

Where LCTCL_{CTC} is the loss for the entire dataset DD.

PCTC=AB1(Y)t=1Tp(atht)P_{CTC} = \sum_{A\in B^{-1}(Y)} \prod_{t=1}^T p(a_t \mid h_t)

Where PCTCP_{CTC} is the probability of output YY given input XX, and it is the highest sum of probability over all its assignments.

0

1

Updated 2026-05-18

Tags

Data Science