Learn Before
Concept

Automatic Speech Recognition Architecture

The architecture for ASR is the encoder-decoder which is implemented with recurrent neural networks (LSTMs) or transformers. For this architecture, it is common to begin with log mel spectral features and map to letters. It’s also possible to sometimes map to induced morpheme-like chunks, such as word pieces or BPE (byte-pair encoding).

0

1

Updated 2022-05-08

Tags

Deep Learning (in Machine learning)

Data Science

Learn After