MIT 18.065 Matrix Methods in Data Analysis, Signal Processing, and Machine Learning, Spring 2018
Instructor: Suvrit Sra
View the complete course: https://ocw.mit.edu/18065S18'>https://ocw.mit.edu/18065S18
YouTube Playlist: • MIT 18.065 Matrix Methods in Data Ana...
Professor Suvrit Sra gives this guest lecture on stochastic gradient descent (SGD), which randomly selects a minibatch of data at each step. The SGD is still the primary method for training largescale machine learning systems.
License: Creative Commons BYNCSA
More information at https://ocw.mit.edu/terms'>https://ocw.mit.edu/terms
More courses at https://ocw.mit.edu