Learn Before
Concept

Spectrogram Prediction

The exact same architecture we described for ASR—the encoder-decoder with attention – can be used for the first component of TTS.

The Tacotron2 architecture is an encoder-decoder that maps from graphmes to spectrograms, followed by a vocoder that maps to wavefiles.

Image 0

0

0

Updated 2022-05-15

Tags

Data (Information)

Data Science

Learn After