MIT: Machine Learning 6.036, Lecture 9: State machines and Markov decision processes (Fall 2020)

Описание к видео MIT: Machine Learning 6.036, Lecture 9: State machines and Markov decision processes (Fall 2020)

* Lecture 9 for the MIT course 6.036: Introduction to Machine Learning (Fall 2020 Semester)
* Full lecture information and slides: http://tamarabroderick.com/ml.html
* Lecture date: 2020 / 10 / 27
* Lecturer: Tamara Broderick
* Lecture TAs: Crystal Wang and Satvat Jagwani

If you find any ways to improve how well the video captions reflect the live lectures, please submit a pull request to: https://github.com/tbroderick/ml_6036...

Errata: At the time of recording, slide 8 had an incorrect V^3_{\pi_B}(poor) value. Thanks to MIT student Bhav Jain for catching the error! The value (108) should now be correct in the video and also in the slides pdf linked above.

0:00:00 Overview & Plan
0:02:31 Intro to extended farming example
0:04:16 State machine
0:13:07 Markov decision process
0:34:54 What's the value of a policy?
0:57:54 What's the best policy?
1:12:39 What if I don't stop farming?

Комментарии

Информация по комментариям в разработке