AI Agents 8 - Evaluation, Cost and Scalability

Автор: Prof. Ghassemi Lectures and Tutorials

Загружено: 2025-11-02

Просмотров: 253550

Описание: In this lecture, Dr. Mohammad Ghassemi explains how to evaluate, optimize, and scale AI agents built with large language models (LLMs). Starting from first principles, he outlines when LLMs should be used, how to select and test models, and how to manage cost-performance tradeoffs. Using the problem of extracting scientific knowledge from 64 million papers since 1996, he demonstrates step-by-step strategies to reduce costs from millions of dollars and centuries of compute to minutes and a few thousand dollars—through parallelization, smaller models, and targeted data retrieval.

Topics include:
Benchmarking LLMs using leaderboards and custom tests
Practical evaluation methods (human, LLM, and metric-based)
Cost modeling and scalability in real systems
Data and tool management via Model Context Protocol (MCP)

Не удается загрузить Youtube-плеер. Проверьте блокировку Youtube в вашей сети.
Повторяем попытку...

AI Agents 8 - Evaluation, Cost and Scalability

Доступные форматы для скачивания:

Скачать видео

Информация по загрузке:

Скачать аудио

Похожие видео

AI Agents 7 - Model Context Protocol

AI Agents 7 - Model Context Protocol

AI Agents 2 - Prompt Engineering.

AI Agents 2 - Prompt Engineering.

LLM и GPT - как работают большие языковые модели? Визуальное введение в трансформеры

LLM и GPT - как работают большие языковые модели? Визуальное введение в трансформеры

Web Application Development - Full Stack Application Examples

Web Application Development - Full Stack Application Examples

Владимир Пастухов и Максим Курников | Интервью BILD

Владимир Пастухов и Максим Курников | Интервью BILD

AI Agents 3 - Agentic Design Patterns

AI Agents 3 - Agentic Design Patterns

Лекция от легенды ИИ в Стэнфорде

Лекция от легенды ИИ в Стэнфорде

Как настроить Claude Code за час и получить второй мозг для решения любых своих задач

Как настроить Claude Code за час и получить второй мозг для решения любых своих задач

Дарио Амодеи — «Мы близки к концу экспоненты»

Дарио Амодеи — «Мы близки к концу экспоненты»

ИИ-агенты — кошмар для безопасности? Разбираемся с OpenClaw

ИИ-агенты — кошмар для безопасности? Разбираемся с OpenClaw

Preparing IT for AI Agents: How MCP Shapes the Future of AI

Preparing IT for AI Agents: How MCP Shapes the Future of AI

Karl Friston’s New AI Architecture

Karl Friston’s New AI Architecture

Why AI evals are the hottest new skill for product builders | Hamel Husain & Shreya Shankar

Why AI evals are the hottest new skill for product builders | Hamel Husain & Shreya Shankar

Don't learn AI Agents without Learning these Fundamentals

Don't learn AI Agents without Learning these Fundamentals

AI Agents 1(a) - What are AI Agents, and why do they matter?

AI Agents 1(a) - What are AI Agents, and why do they matter?

Stanford Webinar - Agentic AI: A Progression of Language Model Usage

Stanford Webinar - Agentic AI: A Progression of Language Model Usage

Лучший документальный фильм про создание ИИ

Лучший документальный фильм про создание ИИ

Claude Code создал мне команду AI-агентов (Claude Code + Skills + MCP)

Claude Code создал мне команду AI-агентов (Claude Code + Skills + MCP)

Экспресс-курс RAG для начинающих

Экспресс-курс RAG для начинающих

1: Introduction to Neural Networks and Deep Learning; Training Deep NNs

1: Introduction to Neural Networks and Deep Learning; Training Deep NNs