DeepMind x UCL RL Lecture Series - MDPs and Dynamic Programming [3/13]

Описание к видео DeepMind x UCL RL Lecture Series - MDPs and Dynamic Programming [3/13]

Research Scientist Diana Borsa explains how to solve MDPs with dynamic programming to extract accurate predictions and good control policies.

Slides: https://dpmd.ai/MDPs
Full video lecture series: https://dpmd.ai/DeepMindxUCL21

Комментарии

Информация по комментариям в разработке