How Large of A Replay Buffer Do You Need? A Deeper Look at Experience Replay | Paper Analysis & Code

Описание к видео How Large of A Replay Buffer Do You Need? A Deeper Look at Experience Replay | Paper Analysis & Code

The size of the experience replay buffer is usually taken for granted. In this recent paper by Sutton and Zhang, they take a look at the effects of the size of the replay buffer on the performance of deep Q learning. Better yet, they create a new type of memory called "Combined Experience Replay". Can we replicate their results? Let's try in this video.

The paper I'm talking about is here:
https://arxiv.org/abs/1712.01275

The code for this is here:
https://github.com/philtabor/Youtube-...

Learn how to turn deep reinforcement learning papers into code:

Get instant access to all my courses, including the new Prioritized Experience Replay course, with my subscription service. $29 a month gives you instant access to 42 hours of instructional content plus access to future updates, added monthly.


Discounts available for Udemy students (enrolled longer than 30 days). Just send an email to [email protected]

https://www.neuralnet.ai/courses

Or, pickup my Udemy courses here:

Deep Q Learning:
https://www.udemy.com/course/deep-q-l...

Actor Critic Methods:
https://www.udemy.com/course/actor-cr...

Curiosity Driven Deep Reinforcement Learning
https://www.udemy.com/course/curiosit...

Natural Language Processing from First Principles:
https://www.udemy.com/course/natural-...


Reinforcement Learning Fundamentals
https://www.manning.com/livevideo/rei...

Here are some books / courses I recommend (affiliate links):
Grokking Deep Learning in Motion: https://bit.ly/3fXHy8W
Grokking Deep Learning: https://bit.ly/3yJ14gT
Grokking Deep Reinforcement Learning: https://bit.ly/2VNAXql

Come hang out on Discord here:
  / discord  

Need personalized tutoring? Help on a programming project? Shoot me an email! [email protected]

Website: https://www.neuralnet.ai
Github: https://github.com/philtabor
Twitter:   / mlwithphil  

Комментарии

Информация по комментариям в разработке