MIT Lecture, Lessons from AlphaZero for Optimal, Model Predictive, and Adaptive Control , Oct 2022

Автор: Dimitri Bertsekas

Загружено: 2022-10-26

Просмотров: 3788

Описание: Slides at http://web.mit.edu/dimitrib/www/abstr...
An outline of the main conceptual framework of my new book, which connects the AI/reinforcement learning, and the decision and control methodologies, through the unifying principles of abstract Dynamic Programming and the algorithmic framework of Newton's method. The application of the main ideas to adaptive control and the solution of the Wordle/NY Times puzzle is also discussed. See http://web.mit.edu/dimitrib/www/abstr...

Не удается загрузить Youtube-плеер. Проверьте блокировку Youtube в вашей сети.
Повторяем попытку...

MIT Lecture, Lessons from AlphaZero for Optimal, Model Predictive, and Adaptive Control , Oct 2022

Доступные форматы для скачивания:

Скачать видео

Информация по загрузке:

Скачать аудио

Похожие видео

Lecture 1, 2023: Introduction, AlphaZero, Deterministic DP, course overview, ASU

Lecture 1, 2023: Introduction, AlphaZero, Deterministic DP, course overview, ASU

IFAC TC on Optimal Control: Data-driven Methods in Control

IFAC TC on Optimal Control: Data-driven Methods in Control

Lecture 12 2024; Off-line training with neural nets for approximate VI and PI. Aggregation

Lecture 12 2024; Off-line training with neural nets for approximate VI and PI. Aggregation

Tutorial 1 Machine Learning Perspectives on Model Predictive Control by Byron Boots

Tutorial 1 Machine Learning Perspectives on Model Predictive Control by Byron Boots

Lecture 1, 2024, course overview: RL and DP, AlphaZero, discrete and continuous applications

Lecture 1, 2024, course overview: RL and DP, AlphaZero, discrete and continuous applications

4 Hours Chopin for Studying, Concentration & Relaxation

4 Hours Chopin for Studying, Concentration & Relaxation

Путин хочет закрыть границы. Мобилизация. Трамп и брат-близнец в Москве | Пастухов, Еловский

Путин хочет закрыть границы. Мобилизация. Трамп и брат-близнец в Москве | Пастухов, Еловский

Асторы: Как нищий эмигрант из Европы стал первым олигархом США / Истории миллиардов / МИНАЕВ

Асторы: Как нищий эмигрант из Европы стал первым олигархом США / Истории миллиардов / МИНАЕВ

Lec 01. Introduction to Deep Learning

Lec 01. Introduction to Deep Learning

Что такое адаптивное управление на основе эталонной модели (MRAC)? | Управление на основе обучени...

Что такое адаптивное управление на основе эталонной модели (MRAC)? | Управление на основе обучени...

МОРОЗОВ: "Все идет к этому, а это будет страшным". Почему у Кремля больше не осталось тормозов

ФСБ отключит связь. Статус S09E24

ФСБ отключит связь. Статус S09E24

Lecture 12, 2025; Training of cost functions, approximation in policy space, policy gradient methods

Lecture 12, 2025; Training of cost functions, approximation in policy space, policy gradient methods

Quantum Computing Day: Introduction to Quantum Computing

Quantum Computing Day: Introduction to Quantum Computing

У программистов осталось 18 месяцев, Нейросеть удалила код AWS, Унитазы спасут ИТ | Как Там АйТи #87

У программистов осталось 18 месяцев, Нейросеть удалила код AWS, Унитазы спасут ИТ | Как Там АйТи #87

Powrót Macierewicza. PIS walczy z SAFE | Opolska, Jędrzejek, Ćwiklak | PYTANIE TYGODNIA

Powrót Macierewicza. PIS walczy z SAFE | Opolska, Jędrzejek, Ćwiklak | PYTANIE TYGODNIA

Музыка для работы за компьютером | Фоновая музыка для концентрации и продуктивности

Музыка для работы за компьютером | Фоновая музыка для концентрации и продуктивности

Lecture 4, 2025, POMDP, Systems with Changing Parameters, Adaptive Control, Model Predictive Control

Lecture 4, 2025, POMDP, Systems with Changing Parameters, Adaptive Control, Model Predictive Control

MIT 6.S191: Convolutional Neural Networks

MIT 6.S191: Convolutional Neural Networks

Abstract Dynamic Programming, Reinforcement Learning, Newton's Method, and Gradient Optimization

Abstract Dynamic Programming, Reinforcement Learning, Newton's Method, and Gradient Optimization