Deploying AI Runtimes

Автор: ecosystem Ai

Загружено: 2026-02-27

Просмотров: 31

Описание: Eric explains how to deploy the Ecosystem AI runtime into production with a focus on Kubernetes/OpenShift, covering three deployment patterns: permanently running runtime pods that accept pushed configuration updates (optionally compiling custom Java pre/post-scoring logic or reward functions), a recommended setup adding a separate Runtime MCP container to expose MCP protocol tools for agents/LLMs, enable MLflow model integration, and support custom Python APIs, and a less agile approach using versioned images with embedded configuration for strict rollback needs.

Demos Python notebook automation for dynamic and static model deployments, including authentication, syncing and compiling code, optional Cassandra config and API config updates, pushing configuration, and testing calls, and discusses scaling considerations and using managed LLM services like Amazon Bedrock rather than running LLMs inside Kubernetes.

00:54 Deployment Patterns Overview
02:54 Always-On Runtime Pods
04:57 Custom Java Logic Builds
07:03 Runtime MCP Benefits
10:19 Versioned Image Deployments
13:02 Demo Setup Notebooks
13:15 Dynamic Model Push Workflow
19:17 Static Models with MLflow
21:52 Scaling and LLM Integration Q&A

Не удается загрузить Youtube-плеер. Проверьте блокировку Youtube в вашей сети.
Повторяем попытку...

Deploying AI Runtimes

Доступные форматы для скачивания:

Скачать видео

Информация по загрузке:

Скачать аудио

Похожие видео

Building Data Pipelines That Scale

Building Data Pipelines That Scale

How Production and Testing Shape AI Outcomes

How Production and Testing Shape AI Outcomes

Inside Our 2026 Product Roadmap

Inside Our 2026 Product Roadmap

Auto-Healing Production Bugs with SuperPlane

Auto-Healing Production Bugs with SuperPlane

Nix at enterprise scale with Determinate

Nix at enterprise scale with Determinate

Sandeep Voona: Mentorship at Scale

Sandeep Voona: Mentorship at Scale

Still Mind Deep Work – Ocean Breeze Ambient | Deep Focus Sounds for Studying, Working & Flow State

Still Mind Deep Work – Ocean Breeze Ambient | Deep Focus Sounds for Studying, Working & Flow State

Microsoft терпит крах: Windows 11 просто удалила интернет!

Microsoft терпит крах: Windows 11 просто удалила интернет!

The Anatomy of Customer Engagement

The Anatomy of Customer Engagement

„Uroki włoskiej prowincji

„Uroki włoskiej prowincji " - ROBERT MAKŁOWICZ WŁOCHY odc.269

20260305 - Generative AI Belgium Meetup

20260305 - Generative AI Belgium Meetup

Deploying ecosystem Ai on OpenShift

Deploying ecosystem Ai on OpenShift

Redefining Customer Segmentation Models

Redefining Customer Segmentation Models

Untapped Power of Interaction Personality

Untapped Power of Interaction Personality

Claude Code 2.0: Масштабное обновление! (Изменит правила игры)

Claude Code 2.0: Масштабное обновление! (Изменит правила игры)

NMOS BCP 008 Minimum Status Reporting Progress, Next Steps, and Why It Matters For You - AMWA

NMOS BCP 008 Minimum Status Reporting Progress, Next Steps, and Why It Matters For You - AMWA

Новый китайский ИИ DuClaw сделал OpenClaw мгновенным и непобедимым.

Новый китайский ИИ DuClaw сделал OpenClaw мгновенным и непобедимым.

Banking on AI for Personalized Experiences

Banking on AI for Personalized Experiences

Stop Cyberattacks Before They Start

Stop Cyberattacks Before They Start

Представляем Digital Optimus: смелое новое видение Илона Маска в области искусственного общего ин...

Представляем Digital Optimus: смелое новое видение Илона Маска в области искусственного общего ин...