Concept

Text to Speech

Text to speech, or speech synthesis, is the inverse problem of automatic speech recognition. It is a sequence-to-sequence learning task where the input is a sequence of text and the output is a generated audio file, meaning the resulting output sequence is significantly longer than the input sequence.

0

1

Updated 2026-05-01

Tags

Data Science

D2L

Dive into Deep Learning @ D2L

Related