ycliper

Популярное

Музыка Кино и Анимация Автомобили Животные Спорт Путешествия Игры Юмор

Интересные видео

2025 Сериалы Трейлеры Новости Как сделать Видеоуроки Diy своими руками

Топ запросов

смотреть а4 schoolboy runaway турецкий сериал смотреть мультфильмы эдисон
Скачать

Milliseconds to Magic: Real‑Time Workflows using the Gemini Live API and Pipecat

Автор: AI Engineer

Загружено: 2025-06-27

Просмотров: 2385

Описание: The Gemini Live API GA is now powered by Google's best cost-effective thinking model Gemini 2.5 Flash. We will do a deep dive on the capabilities that the Gemini Live API combined with Pipecat unlock for devs with special focus on session management, turn detection, tool use (including async function calls), proactivity, multilinguality and integration with telephony and other infra. We will demo some of the more innovative capabilities. We will also talk through some customer use cases - especially how customers can use Pipecat to extend these realtime multimodal capabilities to client side applications such as customer support agents, gaming agents, tutoring agents etc. In addition, we also have an experimental version of the Live API powered by with Google's native audio offering that can be tried in an experimental capacity . This experimental model can communicate with seamless, emotive, steerable, multilingual dialogue and enhances use cases where more natural voices can be a big differentiator.

About Kwindla Kramer
Kwin works on large-scale WebRTC infrastructure at Daily. He is the originator of Pipecat, the widely used, open source, vendor neutral voice agent framework supported by NVIDIA, Google, AWS and used by hundreds of startups. Before co-fonding Daily, Kwin built the sci-fi user interfaces in Minority Report and Iron Man.

About Shrestha Basu Mallick
Shrestha Basu Mallick is Group Product Manager and product lead for Gemini API at Google DeepMind. Prior to this, Shrestha led product development for AI assistance across all Google coding surfaces. Shrestha’s first role in Alphabet was at X, the Moonshot Factory, as Head of Product for a materials discovery platform that has since graduated to become its own startup. Before Google, Shrestha has had various roles in product and strategy at Salesforce Einstein, McKinsey, and Docusign. Shrestha holds a PhD in Applied Physics from Stanford.

Recorded at the AI Engineer World's Fair in San Francisco. Stay up to date on our upcoming events and content by joining our newsletter here: https://www.ai.engineer/newsletter

Не удается загрузить Youtube-плеер. Проверьте блокировку Youtube в вашей сети.
Повторяем попытку...
Milliseconds to Magic: Real‑Time Workflows using the Gemini Live API and Pipecat

Поделиться в:

Доступные форматы для скачивания:

Скачать видео

  • Информация по загрузке:

Скачать аудио

Похожие видео

Pipecat Cloud: Enterprise Voice Agents Built On Open Source - Kwindla Hultman Kramer, Daily

Pipecat Cloud: Enterprise Voice Agents Built On Open Source - Kwindla Hultman Kramer, Daily

«Я выпускал код, который не понимаю, и уверен, что вы тоже» – Джейк Нейшнс, Netflix.

«Я выпускал код, который не понимаю, и уверен, что вы тоже» – Джейк Нейшнс, Netflix.

Manager Skill Rating Training

Manager Skill Rating Training

Gemini Live API FINALLY Breaks Realtime Session Limits! - And Other Gemini's Important Updates

Gemini Live API FINALLY Breaks Realtime Session Limits! - And Other Gemini's Important Updates

GraphRAG: союз графов знаний и RAG: Эмиль Эйфрем

GraphRAG: союз графов знаний и RAG: Эмиль Эйфрем

Reddit, FlutterFlow, & Alafia: Full Technical Deep Dives | NY AI Engineers Meetup

Reddit, FlutterFlow, & Alafia: Full Technical Deep Dives | NY AI Engineers Meetup

Не создавайте агентов, а развивайте навыки – Барри Чжан и Махеш Мураг, Anthropic

Не создавайте агентов, а развивайте навыки – Барри Чжан и Махеш Мураг, Anthropic

Realtime Conversational Video with Pipecat and Tavus — Chad Bailey and Brian Johnson, Daily & Tavus

Realtime Conversational Video with Pipecat and Tavus — Chad Bailey and Brian Johnson, Daily & Tavus

Your MCP Server is Bad (and you should feel bad) - Jeremiah Lowin, Prefect

Your MCP Server is Bad (and you should feel bad) - Jeremiah Lowin, Prefect

Building real-time voice applications with Live API

Building real-time voice applications with Live API

ЛУЧШАЯ БЕСПЛАТНАЯ НЕЙРОСЕТЬ Google, которой нет аналогов

ЛУЧШАЯ БЕСПЛАТНАЯ НЕЙРОСЕТЬ Google, которой нет аналогов

Gemini Live API: Get Started With Voice Agents & AI Apps

Gemini Live API: Get Started With Voice Agents & AI Apps

Build a Multimodal Live Streaming Agent with ADK

Build a Multimodal Live Streaming Agent with ADK

Традиционное машинное обучение мертво — суровая правда 😔

Традиционное машинное обучение мертво — суровая правда 😔

How Claude Code Works - Jared Zoneraich, PromptLayer

How Claude Code Works - Jared Zoneraich, PromptLayer

Why Agent Hype can fall short of reality – Joel Becker, METR

Why Agent Hype can fall short of reality – Joel Becker, METR

Master the Gemini API: A Node.js tutorial with real examples

Master the Gemini API: A Node.js tutorial with real examples

Jack Morris: Stuffing Context is not Memory, Updating Weights is

Jack Morris: Stuffing Context is not Memory, Updating Weights is

Gemini API versus Vertex AI API - What's the Difference?

Gemini API versus Vertex AI API - What's the Difference?

Full Workshop: Realtime Voice AI — Mark Backman, Daily

Full Workshop: Realtime Voice AI — Mark Backman, Daily

© 2025 ycliper. Все права защищены.



  • Контакты
  • О нас
  • Политика конфиденциальности



Контакты для правообладателей: [email protected]