23. Accelerating Gradient Descent (Use Momentum)

Описание к видео 23. Accelerating Gradient Descent (Use Momentum)

MIT 18.065 Matrix Methods in Data Analysis, Signal Processing, and Machine Learning, Spring 2018
Instructor: Gilbert Strang
View the complete course: https://ocw.mit.edu/18-065S18
YouTube Playlist:    • MIT 18.065 Matrix Methods in Data Ana...  

In this lecture, Professor Strang explains both momentum-based gradient descent and Nesterov's accelerated gradient descent.

License: Creative Commons BY-NC-SA
More information at https://ocw.mit.edu/terms
More courses at https://ocw.mit.edu

Комментарии

Информация по комментариям в разработке