Learn Before
Concept
Spectrogram Prediction
The exact same architecture we described for ASR—the encoder-decoder with attention – can be used for the first component of TTS.
The Tacotron2 architecture is an encoder-decoder that maps from graphmes to spectrograms, followed by a vocoder that maps to wavefiles.

0
0
Updated 2022-05-15
Tags
Data (Information)
Data Science