Concept

Connectionist Temporal Classification Training

CTC based automatic speech recognition system uses negative log-likelihood loss with a special CTC loss function.

LCTC=(X,Y)DlogPCTC(YX)L_{CTC} = \sum_{(X,Y) \in D} -log P_{CTC}(Y \mid X) LCTCL_{CTC} - loss for the entire dataset D

PCTC=AB1(Y)t=1Tp(atht)P_{CTC} = \sum_{A\in B^{-1}(Y)} \prod_{t=1}^T p(a_t \mid h_t)

PCTCP_{CTC} - Probability of output Y given input X, and it is the highest sum of probability over all its assignments

0

1

Updated 2022-05-22

Tags

Data Science