Soft Actor Critic is Easy in PyTorch | Complete Deep Reinforcement Learning Tutorial

Автор: Machine Learning with Phil

Загружено: 2020-08-19

Просмотров: 43500

Описание: The soft actor critic algorithm is an off policy actor critic method for dealing with reinforcement learning problems in continuous action spaces. It makes use of a novel framework that seeks to maximize the entropy of our agent. We're going to write our very own SAC agent in PyTorch, starting from scratch.

We're going to need to implement several classes for this project:

A Replay buffer to keep track of the states the agent encountered, the actions it took, and the rewards it received along the way.

A critic network that tells the agent how valuable it thinks the chosen actions were.

A value network that informs the agent how valuable each state is.

We will also make use of ideas from double Q learning, like taking the minimum of estimation from two critics, for our update rules for the value and actor network.

We will test our agent in the Inverted Pendulum environment from the PyBullet package, which is an open 3d rendering and physics engine.

Code for this video is here:

https://github.com/philtabor/Youtube-...

Learn how to turn deep reinforcement learning papers into code:

Get instant access to all my courses, including the new Prioritized Experience Replay course, with my subscription service. $29 a month gives you instant access to 42 hours of instructional content plus access to future updates, added monthly.

Discounts available for Udemy students (enrolled longer than 30 days). Just send an email to [email protected]

https://www.neuralnet.ai/courses

Or, pickup my Udemy courses here:

Deep Q Learning:
https://www.udemy.com/course/deep-q-l...

Actor Critic Methods:
https://www.udemy.com/course/actor-cr...

Curiosity Driven Deep Reinforcement Learning
https://www.udemy.com/course/curiosit...

Natural Language Processing from First Principles:
https://www.udemy.com/course/natural-...
Reinforcement Learning Fundamentals
https://www.manning.com/livevideo/rei...

Here are some books / courses I recommend (affiliate links):
Grokking Deep Learning in Motion: https://bit.ly/3fXHy8W
Grokking Deep Learning: https://bit.ly/3yJ14gT
Grokking Deep Reinforcement Learning: https://bit.ly/2VNAXql

Come hang out on Discord here:
/ discord

Need personalized tutoring? Help on a programming project? Shoot me an email! [email protected]

Website: https://www.neuralnet.ai
Github: https://github.com/philtabor
Twitter: / mlwithphil

#SoftActorCritic #DeepReinforcementLearning #Pytorch

Не удается загрузить Youtube-плеер. Проверьте блокировку Youtube в вашей сети.
Повторяем попытку...

Soft Actor Critic is Easy in PyTorch | Complete Deep Reinforcement Learning Tutorial

Доступные форматы для скачивания:

Скачать видео

Информация по загрузке:

Скачать аудио

Похожие видео

Dueling Double Deep Q Learning is Simple with Tensorflow 2

Dueling Double Deep Q Learning is Simple with Tensorflow 2

Simply Explaining Deep Q-Learning/Deep Q-Network (DQN) | Python Pytorch Deep Reinforcement Learning

Simply Explaining Deep Q-Learning/Deep Q-Network (DQN) | Python Pytorch Deep Reinforcement Learning

Proximal Policy Optimization (PPO) is Easy With PyTorch | Full PPO Tutorial

Proximal Policy Optimization (PPO) is Easy With PyTorch | Full PPO Tutorial

Multicore Deep Reinforcement Learning | Asynchronous Advantage Actor Critic (A3C) Tutorial (PYTORCH)

Multicore Deep Reinforcement Learning | Asynchronous Advantage Actor Critic (A3C) Tutorial (PYTORCH)

Everything You Need To Master Actor Critic Methods | Tensorflow 2 Tutorial

Everything You Need To Master Actor Critic Methods | Tensorflow 2 Tutorial

Can a Random Reinforcement Learning Agent Maximize its Score? Soft Actor Critic (SAC) in Tensorflow2

Can a Random Reinforcement Learning Agent Maximize its Score? Soft Actor Critic (SAC) in Tensorflow2

Soft Actor Critic (V2)

Soft Actor Critic (V2)

Deep Q-Network & Dueling network architectures for deep reinforcement learning

Deep Q-Network & Dueling network architectures for deep reinforcement learning

Deep Reinforcement Learning Tutorials - All Videos

Deep Reinforcement Learning Tutorials - All Videos

DQN in 100 lines of PyTorch code

DQN in 100 lines of PyTorch code

L5 DDPG and SAC (Foundations of Deep RL Series)

L5 DDPG and SAC (Foundations of Deep RL Series)

Overview of Deep Reinforcement Learning Methods

Overview of Deep Reinforcement Learning Methods

Искусственный интеллект высадил ИИ на Луну! | Глубокое Q-обучение | PyTorch | Обучение с подкрепл...

Искусственный интеллект высадил ИИ на Луну! | Глубокое Q-обучение | PyTorch | Обучение с подкрепл...

Reinforcement Learning in Continuous Action Spaces | DDPG Tutorial (Pytorch)

Reinforcement Learning in Continuous Action Spaces | DDPG Tutorial (Pytorch)

Proximal Policy Optimization Explained

Proximal Policy Optimization Explained

Actor Critic Algorithms

Actor Critic Algorithms

Soft Actor-Critic: a beginner-friendly introduction

Soft Actor-Critic: a beginner-friendly introduction

The FASTEST introduction to Reinforcement Learning on the internet

The FASTEST introduction to Reinforcement Learning on the internet

Часть 1 из 3 — Реализация оптимизации проксимальной политики: 11 основных деталей реализации

Часть 1 из 3 — Реализация оптимизации проксимальной политики: 11 основных деталей реализации

Dueling Double Deep Q Learning is Easy in PyTorch

Dueling Double Deep Q Learning is Easy in PyTorch