EMNLP 25 Tutorial - Spoken Conversational Agents with LLMs
Автор: Chao-Han Huck Yang
Загружено: 2025-12-01
Просмотров: 16
Описание:
Slides: https://huckiyang.github.io/emnlp-25-...
Spoken conversational agents are converging toward voice-native LLMs. This tutorial distills the path from cascaded ASR/NLU to end-to-end, retrieval-and vision-grounded systems. We frame adaptation of text LLMs to audio, cross-modal alignment, and joint speech–text training; review datasets, metrics, and robustness across accents; and compare design choices (cascaded vs. E2E, post-ASR correction, streaming).
We link industrial experts to current open-domain and task-oriented agents, highlight reproducible baselines, and outline open problems in privacy, safety, and evaluation. Attendees leave with practical recipes and a clear systems-level roadmap.
Повторяем попытку...
Доступные форматы для скачивания:
Скачать видео
-
Информация по загрузке: