Scaling Generative AI Inference with llm-d - DevConf.IN 2026

Автор: DevConf

Загружено: 2026-02-18

Просмотров: 14

Описание: Title: Scaling Generative AI Inference with llm-d

Speaker(s): Dasharath Masirkar

---

Generative AI models are rapidly changing the landscape of application development, but deploying and serving these large models in production at scale presents significant challenges. llm-d is an open-source, Kubernetes-native distributed inference serving stack designed to address these complexities. This session will introduce developers to llm-d, demonstrating how it provides "well-lit paths" to serve large generative AI models with the fastest time-to-value and competitive performance across diverse hardware accelerators. Attendees will learn about llm-d's architecture, key features, and how to leverage its tested and benchmarked recipes for production deployments, focusing on practical applications and best practices.

---

Full schedule, including slides and other resources:
https://pretalx.devconf.info/devconf-...

Не удается загрузить Youtube-плеер. Проверьте блокировку Youtube в вашей сети.
Повторяем попытку...

Scaling Generative AI Inference with llm-d - DevConf.IN 2026

Доступные форматы для скачивания:

Скачать видео

Информация по загрузке:

Скачать аудио

Похожие видео

Scaling ML Pipelines with Feast, Ray and Kubeflow - DevConf.IN 2026

Scaling ML Pipelines with Feast, Ray and Kubeflow - DevConf.IN 2026

Supercharge Your GitOps with ArgoCD Agent - DevConf.IN 2026

Supercharge Your GitOps with ArgoCD Agent - DevConf.IN 2026

Integrating Large Language Models (LLMs) into your Security Stack

Integrating Large Language Models (LLMs) into your Security Stack

The Compute Revolution You’re Ignoring: JavaScript in Science- DevConf.IN 2026

The Compute Revolution You’re Ignoring: JavaScript in Science- DevConf.IN 2026

Как атаковать системы ИИ (и как им защищаться)!!!! - DevConf.IN 2026

Как атаковать системы ИИ (и как им защищаться)!!!! - DevConf.IN 2026

MPI Meets Machine Learning: Unlocking PyTorch distributed for scaling AI workloads - DevConf.IN 2026

MPI Meets Machine Learning: Unlocking PyTorch distributed for scaling AI workloads - DevConf.IN 2026

Beyond ImagePullBackOff: Распределенный реестр без сохранения состояния и секретных ключей для пе...

Beyond ImagePullBackOff: Распределенный реестр без сохранения состояния и секретных ключей для пе...

Глубокая музыка для сосредоточенной работы – Лесной оазис Ultimate Playlist 🌲

Глубокая музыка для сосредоточенной работы – Лесной оазис Ultimate Playlist 🌲

I Spent 200 Million Tokens Vibe Coding With Gemini 3.1 Pro

I Spent 200 Million Tokens Vibe Coding With Gemini 3.1 Pro

ОБХОД ВСЕХ БЛОКОВ👍 БЕСПЛАТНОЕ ПРИЛОЖЕНИЕ БЕЗ РЕКЛАМЫ! ОБХОД БЛОКИРОВОК Ютуб, Телеграм, Ватсап!

ОБХОД ВСЕХ БЛОКОВ👍 БЕСПЛАТНОЕ ПРИЛОЖЕНИЕ БЕЗ РЕКЛАМЫ! ОБХОД БЛОКИРОВОК Ютуб, Телеграм, Ватсап!

Sam Altman Reveals What He'd

Sam Altman Reveals What He'd "Never Ask" ChatGPT, Says "Govts Shouldn't Control.." | OpenAI | VERTEX

Vito Bambino - Decyzje (ft. Zalia)

Vito Bambino - Decyzje (ft. Zalia)

Музыка глубокой концентрации Ambient Chillout Mix для максимальной продуктивности и работы

Музыка глубокой концентрации Ambient Chillout Mix для максимальной продуктивности и работы

Ukraina zrobiła coś NIEWIARYGODNEGO w Pokrowsku… Rosja nie ma szans!

Ukraina zrobiła coś NIEWIARYGODNEGO w Pokrowsku… Rosja nie ma szans!

Bridging DevOps and MLOps: Unifying Pipelines with KitOps and GitOps - DevConf.IN 2026

Bridging DevOps and MLOps: Unifying Pipelines with KitOps and GitOps - DevConf.IN 2026

Talking Chaos: An AI Co-Pilot for Resilience With LitmusChaos- DevConf.IN 2026

Talking Chaos: An AI Co-Pilot for Resilience With LitmusChaos- DevConf.IN 2026

Lost in Silence: Understanding When Screen Readers Don’t Speak Up- DevConf.IN 2026

Lost in Silence: Understanding When Screen Readers Don’t Speak Up- DevConf.IN 2026

Deep Focus Work Music — Boost Productivity & Relieve Stress Instantly

Deep Focus Work Music — Boost Productivity & Relieve Stress Instantly

Battery Range Prediction using Federated Learning on Edge

Battery Range Prediction using Federated Learning on Edge

The Cybersecurity Trifecta: Privacy, Governance, and Data Sovereignty in Action - DevConf.IN 2026

The Cybersecurity Trifecta: Privacy, Governance, and Data Sovereignty in Action - DevConf.IN 2026