Davide Paglieri - Adversarial examples to Multi-Agent RL with Quality Diversity

Автор: RL and Agents Reading Group

Загружено: 2024-08-16

Просмотров: 183

Описание: UoE RL Reading Group | 11 July 2024

Speaker: Davide Paglieri (UCL DARK)

Title: Adversarial examples to Multi-Agent RL with Quality Diversity

Abstract: In the rapidly advancing field of multi-agent systems, ensuring robustness in unfamiliar and adversarial settings is crucial. Notwithstanding their outstanding performance in familiar environments, these systems often falter in new situations due to overfitting during the training phase. This is especially pronounced in settings where both cooperative and competitive behaviours are present, encapsulating a dual nature of overfitting and generalisation challenges. To address this issue, we present Multi-Agent Diagnostics for Robustness via Illuminated Diversity (MADRID), a novel approach for generating diverse adversarial scenarios that expose strategic vulnerabilities in pre-trained multi-agent policies. Leveraging the concepts from open-ended learning, MADRID navigates the vast space of adversarial settings, employing a target policy's regret to gauge the vulnerabilities of these settings. We evaluate the effectiveness of MADRID on the 11vs11 version of Google Research Football, one of the most complex environments for multi-agent reinforcement learning. Specifically, we employ MADRID for generating a diverse array of adversarial settings for TiZero, the state-of-the-art approach which "masters" the game through 45 days of training on a large-scale distributed infrastructure. We expose key shortcomings in TiZero's tactical decision-making, underlining the crucial importance of rigorous evaluation in multi-agent systems.

Link: https://arxiv.org/abs/2401.13460

Не удается загрузить Youtube-плеер. Проверьте блокировку Youtube в вашей сети.
Повторяем попытку...

Davide Paglieri - Adversarial examples to Multi-Agent RL with Quality Diversity

Доступные форматы для скачивания:

Скачать видео

Информация по загрузке:

Скачать аудио

Похожие видео

Tutorial on Quality-Diversity Optimisation

Tutorial on Quality-Diversity Optimisation

Matthew Jackson and Jarek Liesen (Oxford) - A Clean Slate for Offline RL

Matthew Jackson and Jarek Liesen (Oxford) - A Clean Slate for Offline RL

11/6: Data Series: Data Collaboration and Governance

11/6: Data Series: Data Collaboration and Governance

Adversarial Examples and Human-ML Alignment

Adversarial Examples and Human-ML Alignment

Lecture 16 | Adversarial Examples and Adversarial Training

Lecture 16 | Adversarial Examples and Adversarial Training

NotebookLM на максималках. Как изучать всё быстрее чем 99% пользователей

NotebookLM на максималках. Как изучать всё быстрее чем 99% пользователей

Overview of Adversarial Machine Learning

Overview of Adversarial Machine Learning

Дороничев: ИИ — пузырь, который скоро ЛОПНЕТ. Какие перемены ждут мир?

Дороничев: ИИ — пузырь, который скоро ЛОПНЕТ. Какие перемены ждут мир?

Как Windows работает с ОЗУ или почему вам НЕ НУЖНЫ гигабайты памяти

Как Windows работает с ОЗУ или почему вам НЕ НУЖНЫ гигабайты памяти

Как Гений Математик разгадал тайну вселенной

Как Гений Математик разгадал тайну вселенной

Жириновский: остатки Ирана и Турции войдут в состав России! Воскресный вечер с Соловьевым. 13.05.18

Жириновский: остатки Ирана и Турции войдут в состав России! Воскресный вечер с Соловьевым. 13.05.18

Лучший документальный фильм про создание ИИ

Лучший документальный фильм про создание ИИ

МФТИ: Кто создает будущее дронов?

МФТИ: Кто создает будущее дронов?

LLM и GPT - как работают большие языковые модели? Визуальное введение в трансформеры

LLM и GPT - как работают большие языковые модели? Визуальное введение в трансформеры

OpenClaw - полный разбор: Tools, Skills, Agents, Sub-agents

OpenClaw - полный разбор: Tools, Skills, Agents, Sub-agents

Фильм Алексея Семихатова «ГРАВИТАЦИЯ»

Фильм Алексея Семихатова «ГРАВИТАЦИЯ»

Лекция от легенды ИИ в Стэнфорде

Лекция от легенды ИИ в Стэнфорде

Джон Кирьяку: Удар ЦРУ в ответном нападении Ирана — разбор войны

Джон Кирьяку: Удар ЦРУ в ответном нападении Ирана — разбор войны

КАК УСТРОЕН TCP/IP?

КАК УСТРОЕН TCP/IP?

Владимир Жириновский дал прогноз по ситуации с Ираном

Владимир Жириновский дал прогноз по ситуации с Ираном