Speaker diarization: the foundational layer of conversational AI - AI Engineer Paris 2025
Автор: Koyeb
Загружено: 2025-10-23
Просмотров: 1049
Описание:
AI Engineer Paris 2025 → https://www.ai.engineer/paris
Before LLMs, before Speech-To-Text, speaker diarization is the foundational layer of conversational AI pipelines. Getting "who speaks when" wrong may lead to catastrophic predictions down the line. From digital meeting notetakers to AI medical scribes, from AI video dubbing to podcast intelligence platforms, knowing "who said what" is often just as important as "what was said" in a conversation.
In this talk, Hervé will introduce what speaker diarization is, what it is not, and why this apparently simple machine learning problem has yet to be solved.
Speaker: Hervé Bredin, Co-founder and Chief Science Officer, pyannoteAI
Повторяем попытку...
Доступные форматы для скачивания:
Скачать видео
-
Информация по загрузке: