Yes you can run LLMs on Kubernetes | Cloud Native Denmark 2025 Aarhus

Автор: Cloud Native Nordics

Загружено: 2025-12-31

Просмотров: 43

Описание: As LLMs become increasingly powerful and ubiquitous, the need to deploy and scale these models in production environments grows. However, the complexity of LLMs can make them challenging to run reliably and efficiently. In this talk, we'll explore how Kubernetes can be leveraged to run LLMs at scale. We'll cover the key considerations and best practices for packaging LLM inference services as containerized applications using popular OSS inference servers like TGI, vLLM and Ollama, and deploying them on Kubernetes. This includes managing model weights, handling dynamic batching and scaling, implementing advanced traffic routing, and ensuring high availability and fault tolerance. Additionally, we'll discuss accelerators management and serving models on multiple hosts. By the end of this talk, attendees will have a comprehensive understanding of how to successfully run their LLMs on Kubernetes, unlocking the benefits of scalability, resilience, and DevOps-friendly deployments.

Cloud Native Denmark is a premier tech conference where Kubernetes and Cloud Native community comes together for an experience packed with inspiring talks, hands-on workshops, and great opportunities to build professional networks.

🚀 CND Website: https://cloudnativedenmark.dk/
🚀 CND 2025 Conference Archive: https://2025.cloudnativedenmark.dk/

Не удается загрузить Youtube-плеер. Проверьте блокировку Youtube в вашей сети.
Повторяем попытку...

Yes you can run LLMs on Kubernetes | Cloud Native Denmark 2025 Aarhus

Доступные форматы для скачивания:

Скачать видео

Информация по загрузке:

Скачать аудио

Похожие видео

Not Forking Around: Leveraging NRI to extend Kubernetes at scale | Cloud Native Denmark 2025 Aarhus

Not Forking Around: Leveraging NRI to extend Kubernetes at scale | Cloud Native Denmark 2025 Aarhus

Driving Platform Adoption with Embedded SREs | Cloud Native Denmark 2025 Aarhus

Driving Platform Adoption with Embedded SREs | Cloud Native Denmark 2025 Aarhus

Kubernetes and AI to Protect Our Forests | Cloud Native Denmark 2025 Aarhus

Kubernetes and AI to Protect Our Forests | Cloud Native Denmark 2025 Aarhus

Multi-Site CSI for k8s | Cloud Native Denmark 2025 Aarhus

Multi-Site CSI for k8s | Cloud Native Denmark 2025 Aarhus

The Hitchhikers Practical Guide to MLOps | Cloud Native Denmark 2025 Aarhus

The Hitchhikers Practical Guide to MLOps | Cloud Native Denmark 2025 Aarhus

Код работает в 100 раз медленнее из-за ложного разделения ресурсов.

Код работает в 100 раз медленнее из-за ложного разделения ресурсов.

Илон Маск ошеломил Джо Рогана: «Что произойдет, когда искусственный интеллект будет управлять всем?»

Илон Маск ошеломил Джо Рогана: «Что произойдет, когда искусственный интеллект будет управлять всем?»

Mongo DB v1 4k+ Boot Dev

Mongo DB v1 4k+ Boot Dev

The Windows 11 Disaster That's Killing Microsoft

The Windows 11 Disaster That's Killing Microsoft

This is the moment everyone has been waiting for

This is the moment everyone has been waiting for

KRO-nicles of Kubernetes: Taming Resources the Open Source Way | Cloud Native Denmark 2025 Aarhus

KRO-nicles of Kubernetes: Taming Resources the Open Source Way | Cloud Native Denmark 2025 Aarhus

This is why I believe that the future already exists

This is why I believe that the future already exists

August Edition: Next-Gen Kubernetes: AI, Edge & Open Source

August Edition: Next-Gen Kubernetes: AI, Edge & Open Source

Claude Canvas превращает код Claude в визуальное терминальное приложение!

Claude Canvas превращает код Claude в визуальное терминальное приложение!

Hegseth touts deal with Musk’s Grok for new military AI strategy: 'Winning requires a new playbook'

Hegseth touts deal with Musk’s Grok for new military AI strategy: 'Winning requires a new playbook'

Meetup @ Supermetrics 27.11

Meetup @ Supermetrics 27.11

Bill Gates UNDER FIRE as Windows 11 Forces Changes Users NEVER Asked For

Bill Gates UNDER FIRE as Windows 11 Forces Changes Users NEVER Asked For

API Conference Lagos 2025 - Monitoring Your APIs Like a Superhero with the Kubernetes Prometheus

API Conference Lagos 2025 - Monitoring Your APIs Like a Superhero with the Kubernetes Prometheus

Londyn | Wypowiedź Prezydenta Karola Nawrockiego dla polskich mediów

Londyn | Wypowiedź Prezydenta Karola Nawrockiego dla polskich mediów

Jensen Huang reveals Nvidia's latest innovations at CES 2026

Jensen Huang reveals Nvidia's latest innovations at CES 2026