COMPSCI 188 - 2018-09-25 - Reinforcement Learning Part 1/2

Описание к видео COMPSCI 188 - 2018-09-25 - Reinforcement Learning Part 1/2

COMPSCI 188, LEC 001 - Fall 2018
COMPSCI 188, LEC 001 - Pieter Abbeel, Daniel Klein
Copyright @2018 UC Regents; all rights reserved

"Slides (from 2018): https://inst.eecs.berkeley.edu/~cs188...
Latest website: https://inst.eecs.berkeley.edu/~cs188
More resources: http://ai.berkeley.edu

00:00 Setup [no content]
02:03 Announcements [outdated]
05:13 RL Introduction
07:15 RL Applications
15:26 RL Definition
18:40 Model-Based Learning
28:15 Model-Based vs. Model-Free Estimation
34:18 Passive RL
35:51 Direct Evaluation
41:40 Sample-Based Policy Evaluation?
45:47 Temporal Difference Learning
50:26 TD Learning: Example
53:33 Break [no content]
55:55 Problems with TD Learning
1:00:32 Active RL
1:02:01 Q-Value Iteration
1:05:50 Q-Learning
1:16:43 Q-Learning: Crawler Bot Demo
1:18:38 Q-Learning Properties
1:19:53 End [no content]"

Комментарии

Информация по комментариям в разработке