Learn Before
Concept

Speaker diarization

Speaker diarization is the task of determining 'who spoke when' in a long multi-speaker audio recording, marking the start and end of each speaker's turns in the interaction. It can be useful for transcribing meetings, classroom speech, or medical interactions. Often times, diarization systems use voice activity detection to find segments of continuous speech, extract speaker embedding vectors, and cluster the vectors to group together segments likely from the same speaker.

0

1

Updated 2022-05-15

Tags

Deep Learning (in Machine learning)

Data Science