Introduction to Reinforcement Learning - Shane M. Conway

Автор: AI Council

Загружено: 2014-11-18

Просмотров: 14714

Описание: See the full post here: https://www.hakkalabs.co/articles/int...

Machine learning is often divided into three categories: supervised, unsupervised, and reinforcement learning. Reinforcement learning concerns problems with sequences of decisions (where each decision affects subsequent opportunities), in which the effects can be uncertain, and with potentially long-term goals. It has achieved immense success in various different fields, especially AI/Robotics and Operations Research, by providing a framework for learning from interactions with an environment and feedback in the form of rewards and penalties.

Shane Conway, researcher at Kepos Capital, gives a general overview of reinforcement learning, covering how to solve cases where there is uncertainty both in actions and states, as well as where the state space is very large.

ABOUT DATA COUNCIL:
Data Council (https://www.datacouncil.ai/) is a community and conference series that provides data professionals with the learning and networking opportunities they need to grow their careers. Make sure to subscribe to our channel for more videos, including DC_THURS, our series of live online interviews with leading data professionals from top open source projects and startups.

FOLLOW DATA COUNCIL:
Twitter:   / datacouncilai
LinkedIn:   / datacouncil-ai
Facebook:   / datacouncilai
Eventbrite: https://www.eventbrite.com/o/data-cou...

Не удается загрузить Youtube-плеер. Проверьте блокировку Youtube в вашей сети.
Повторяем попытку...

Introduction to Reinforcement Learning - Shane M. Conway

Доступные форматы для скачивания:

Скачать видео

Информация по загрузке:

Скачать аудио

Похожие видео

Marchine learning: Convex relaxations for weakly supervised information extraction - Edouard Grave

Marchine learning: Convex relaxations for weakly supervised information extraction - Edouard Grave

Введение в методы градиента политики — глубокое обучение с подкреплением

Введение в методы градиента политики — глубокое обучение с подкреплением

Why is Everyone Talking About Apache Iceberg™?

Why is Everyone Talking About Apache Iceberg™?

Пожалуй, главное заблуждение об электричестве [Veritasium]

Пожалуй, главное заблуждение об электричестве [Veritasium]

Тоннель под Ла-Маншем | Потрясающие инженерные решения, лежащие в его основе

Тоннель под Ла-Маншем | Потрясающие инженерные решения, лежащие в его основе

4 Hours Chopin for Studying, Concentration & Relaxation

4 Hours Chopin for Studying, Concentration & Relaxation

Как Сделать Настольный ЭЛЕКТРОЭРОЗИОННЫЙ Станок?

Как Сделать Настольный ЭЛЕКТРОЭРОЗИОННЫЙ Станок?

Valid Inference after Model Selection and the selectiveInference Package | NYU

Valid Inference after Model Selection and the selectiveInference Package | NYU

C++: Самый Противоречивый Язык Программирования

C++: Самый Противоречивый Язык Программирования

Лучший документальный фильм про создание ИИ

Лучший документальный фильм про создание ИИ

Liberate Analytical Data Management with DuckDB

Liberate Analytical Data Management with DuckDB

Введение в обучение с подкреплением

Введение в обучение с подкреплением

Spark MLlib: Making Practical Machine Learning Easy and Scalable

Spark MLlib: Making Practical Machine Learning Easy and Scalable

Markov Decision Processes

Markov Decision Processes

Уникальная немецкая кинохроника штурма Брестской крепости (1941)

Уникальная немецкая кинохроника штурма Брестской крепости (1941)

A friendly introduction to deep reinforcement learning, Q-networks and policy gradients

A friendly introduction to deep reinforcement learning, Q-networks and policy gradients

16. Learning: Support Vector Machines

16. Learning: Support Vector Machines

Scalability is Quantifiable: The Universal Scalability Law | VividCortex

Scalability is Quantifiable: The Universal Scalability Law | VividCortex

Почему МАЛЕНЬКИЙ атом создает такой ОГРОМНЫЙ взрыв?

Почему МАЛЕНЬКИЙ атом создает такой ОГРОМНЫЙ взрыв?

Визуализация скрытого пространства: PCA, t-SNE, UMAP | Глубокое обучение с анимацией

Визуализация скрытого пространства: PCA, t-SNE, UMAP | Глубокое обучение с анимацией