Qwen3‑TTS Multi‑Speaker in ComfyUI | Voice Cloning, Voice Design, Overlaps & Background Audio
Автор: Vantage with AI
Загружено: 2026-01-28
Просмотров: 2254
Описание:
🎬 Qwen3‑TTS Multi‑Speaker Dialogue in ComfyUI
In this video, I demonstrate an advanced ComfyUI workflow built on top of Qwen3‑TTS that enables realistic multi‑speaker AI dialogue with professional audio control.
We start by exploring the Qwen3‑TTS model family, including Base, VoiceDesign, and CustomVoice variants, and then move into a custom ComfyUI node that transforms single‑prompt TTS into a full dialogue engine.
🔹 What this video covers:
Qwen3‑TTS model overview (1.7B & 0.6B variants)
Speaker creation using Voice Design (text‑based)
Speaker creation using Voice Cloning (audio‑based)
Reusable named speakers for long projects
Script‑based multi‑speaker dialogue generation
Natural timing with gaps, overlaps, and interruptions
Crossfades and volume control per speaker
Background music & ambience support
Audio ducking and cinematic mixing
Production‑ready audio output inside ComfyUI
This workflow is ideal for:
AI storytelling
Podcasts and narration
Cinematic dialogue scenes
Games and interactive content
Long‑form multi‑character conversations
Workflow Download Link
https://www.patreon.com/posts/qwen3-t...
Custom Node
https://github.com/vantagewithai/Vant...
Update if already installed, for new nodes to work
⚠️ This is not an official Qwen pipeline — it’s a production‑focused extension built on top of Qwen3‑TTS for real‑world audio workflows.
#Qwen3‑TTS, #MultiSpeakerTTS, #VoiceCloningAI, #TexttoSpeechWorkflow
Повторяем попытку...
Доступные форматы для скачивания:
Скачать видео
-
Информация по загрузке: