Tristan Tomilin - Benchmarking Pixel-Based RL in Egocentric Perception Environments

Автор: RL and Agents Reading Group

Загружено: 2024-10-27

Просмотров: 148

Описание: UoE RL Reading Group | 17 October 2024

Speaker: Tristan Tomilin (Technical University of Eindhoven)

Title: Benchmarking Pixel-Based RL in Egocentric Perception Environments.

Abstract: In the pursuit of advancing autonomous systems through Reinforcement Learning (RL), benchmarking is essential for assessing performance, validating the ability to interpret and navigate complex environments, and providing a standardized framework for comparison and improvement. However, existing benchmarks for pixel-based learning within embodied environments often face challenges, including high computational demands, a complex setup, insufficient documentation, an absence of standardized metrics, and a lack of baseline evaluations. In this talk, I will introduce new benchmarks targeting these deficiencies, specifically designed for generalization, continual learning, and safe RL.

Link(s): https://ieeexplore.ieee.org/document/...

Bio: Tristan Tomilin is a PhD student in the Data Mining group in the Department of Mathematics & Computer Science at the Technical University of Eindhoven since 2021. With interests spanning across several domains of Reinforcement Learning, including continual learning, multi-agent systems, safe RL, and generalization, his research is characterized by a focus on developing more robust, reliable, and meaningful simulation environments.

Не удается загрузить Youtube-плеер. Проверьте блокировку Youtube в вашей сети.
Повторяем попытку...

Tristan Tomilin - Benchmarking Pixel-Based RL in Egocentric Perception Environments

Доступные форматы для скачивания:

Скачать видео

Информация по загрузке:

Скачать аудио

Похожие видео

Riccardo Zamboni - Pure Exploration in POMDP: limits and possible solutions

Riccardo Zamboni - Pure Exploration in POMDP: limits and possible solutions

Adam White - Empirical Design in Reinforcement Learning

Adam White - Empirical Design in Reinforcement Learning

Samuel Garcin & Trevor McInroe - Studying the Interplay Between Actor / Critic Representations in RL

Samuel Garcin & Trevor McInroe - Studying the Interplay Between Actor / Critic Representations in RL

Объяснение геометрии скрытого пространства | Геометрическое расширение в ИИ

Объяснение геометрии скрытого пространства | Геометрическое расширение в ИИ

LLM и GPT - как работают большие языковые модели? Визуальное введение в трансформеры

LLM и GPT - как работают большие языковые модели? Визуальное введение в трансформеры

Лучший документальный фильм про создание ИИ

Лучший документальный фильм про создание ИИ

КАК УСТРОЕН TCP/IP?

КАК УСТРОЕН TCP/IP?

«Китайский Нострадамус» профессор Сюэциня сделал мрачное предсказание: чем закончится война в Иране

«Китайский Нострадамус» профессор Сюэциня сделал мрачное предсказание: чем закончится война в Иране

Cam Allen - The Agent Must Choose the Problem Model

Cam Allen - The Agent Must Choose the Problem Model

Qwen 3.5 Plus УНИЧТОЖАЕТ платные AI! Бесплатно + уровень Claude Opus

Qwen 3.5 Plus УНИЧТОЖАЕТ платные AI! Бесплатно + уровень Claude Opus

Лекция от легенды ИИ в Стэнфорде

Лекция от легенды ИИ в Стэнфорде

Вся IT-база в ОДНОМ видео: Память, Процессор, Код

Вся IT-база в ОДНОМ видео: Память, Процессор, Код

Как Сделать Настольный ЭЛЕКТРОЭРОЗИОННЫЙ Станок?

Как Сделать Настольный ЭЛЕКТРОЭРОЗИОННЫЙ Станок?

From Deep Reinforcement Learning to LLM-based Agents: Perspectives on Current Research

From Deep Reinforcement Learning to LLM-based Agents: Perspectives on Current Research

Я сэкономил 1460 часов на обучении (NotebookLM + Gemini + Obsidian)

Я сэкономил 1460 часов на обучении (NotebookLM + Gemini + Obsidian)

Электричество НЕ течёт по проводам — тревожное открытие Ричарда Фейнмана

Электричество НЕ течёт по проводам — тревожное открытие Ричарда Фейнмана

Безопасность AI или контроль? Что происходит внутри крупнейших AI-компаний

Безопасность AI или контроль? Что происходит внутри крупнейших AI-компаний

AI агенты в 2026: всё что работает прямо сейчас (Claude Code, n8n, RAG, OpenClaw, Agent Teams)

AI агенты в 2026: всё что работает прямо сейчас (Claude Code, n8n, RAG, OpenClaw, Agent Teams)

Lukas Schäfer - Ensemble Value Functions for Efficient Exploration in Multi-Agent RL

Lukas Schäfer - Ensemble Value Functions for Efficient Exploration in Multi-Agent RL

Архитектура интернета и веба | Теоретический курс 2026

Архитектура интернета и веба | Теоретический курс 2026