Claude Formanek - Dispelling the Mirage of Progress in Offline MARL through Standardise Baselines...

Автор: RL and Agents Reading Group

Загружено: 2025-01-23

Просмотров: 98

Описание: UoE RL Reading Group | 23 January 2025

Speaker: Claude Formanek (University of Cape Town & InstaDeep)

Title: Dispelling the Mirage of Progress in Offline MARL through Standardise Baselines, Datasets, and Evaluation

Abstract: In this talk, I present two lines of work that seek to elevate the rigour and impact of offline MARL research. First, I reveal that simple, carefully implemented baselines can often surpass or match complex state-of-the-art methods, underscoring the need for consistent evaluation protocols. Second, I show how dataset generation and usage are frequently neglected, making fair comparisons difficult. To address this, I propose a standardized repository of over 80 datasets—complete with consistent formatting, an easy-to-use API, and robust analysis tools. By adopting these best practices, we can foster greater reproducibility and reliability in offline MARL.

Link(s): https://arxiv.org/abs/2406.09068

Bio: Claude is a PhD candidate at the University of Cape Town, specializing in Offline Multi-Agent Reinforcement Learning. He is also a Research Engineer at InstaDeep, focusing on Reinforcement Learning for Industrial Optimization.

Не удается загрузить Youtube-плеер. Проверьте блокировку Youtube в вашей сети.
Повторяем попытку...

Claude Formanek - Dispelling the Mirage of Progress in Offline MARL through Standardise Baselines...

Доступные форматы для скачивания:

Скачать видео

Информация по загрузке:

Скачать аудио

Похожие видео

Lukas Schäfer - Ensemble Value Functions for Efficient Exploration in Multi-Agent RL

Lukas Schäfer - Ensemble Value Functions for Efficient Exploration in Multi-Agent RL

Алексей Венедиктов*. Без посредников / 11.03.26

Алексей Венедиктов*. Без посредников / 11.03.26

Astronomy and astrophysics in Africa | The Royal Society

Astronomy and astrophysics in Africa | The Royal Society

Cam Allen - The Agent Must Choose the Problem Model

Cam Allen - The Agent Must Choose the Problem Model

🎙 Честное слово с Владимиром Миловым

🎙 Честное слово с Владимиром Миловым

Лучший документальный фильм про создание ИИ

Лучший документальный фильм про создание ИИ

Adam White - Empirical Design in Reinforcement Learning

Adam White - Empirical Design in Reinforcement Learning

Гипотеза Пуанкаре — Алексей Савватеев на ПостНауке

Гипотеза Пуанкаре — Алексей Савватеев на ПостНауке

Pablo Samuel Castro - Mixtures of Experts Unlock Parameter Scaling for Deep RL

Pablo Samuel Castro - Mixtures of Experts Unlock Parameter Scaling for Deep RL

Eduardo Pignatelli - On the temporal credit assignment in Deep RL

Eduardo Pignatelli - On the temporal credit assignment in Deep RL

Materials Science | NMC 113/123 | Chapter 4: Imperfections in Solids by 123tutors

Materials Science | NMC 113/123 | Chapter 4: Imperfections in Solids by 123tutors

Лекция от легенды ИИ в Стэнфорде

Лекция от легенды ИИ в Стэнфорде

Tristan Tomilin - Benchmarking Pixel-Based RL in Egocentric Perception Environments

Tristan Tomilin - Benchmarking Pixel-Based RL in Egocentric Perception Environments

Joe Marino (Google DeepMind) - SIMA 2: A Generalist Embodied Agent for Virtual Worlds

Joe Marino (Google DeepMind) - SIMA 2: A Generalist Embodied Agent for Virtual Worlds

Maths in the City: Future Cities

Maths in the City: Future Cities

Yifan Zhong & Jiarong Liu - Maximum Entropy Heterogeneous-Agent Reinforcement Learning

Yifan Zhong & Jiarong Liu - Maximum Entropy Heterogeneous-Agent Reinforcement Learning

Joe Marino - Modern Video Games as a Testbed for Developing Generalist AI Agents

Joe Marino - Modern Video Games as a Testbed for Developing Generalist AI Agents

Полный гайд по Claude: как выжать максимум из этой нейросети

Полный гайд по Claude: как выжать максимум из этой нейросети

David Abel - A Definition of Continual Reinforcement Learning

David Abel - A Definition of Continual Reinforcement Learning

Matthew Jackson and Jarek Liesen (Oxford) - A Clean Slate for Offline RL

Matthew Jackson and Jarek Liesen (Oxford) - A Clean Slate for Offline RL