6.7960 Deep Learning
6.7960 - Deep Learning (Fall 2024, MIT OCW). Instructors: Prof. Phillip Isola, Prof. Sara Beery, and Dr. Jeremy Bernstein. This course covers the fundamentals of deep learning, including both theory and applications. Topics include neural net architectures (MLPs, CNNs, RNNs, graph nets, transformers), geometry and invariances in deep learning, backpropagation and automatic differentiation, learning theory and generalization in high dimensions, and applications to computer vision, natural language processing, and robotics. (from ocw.mit.edu)
| Lecture 20 - Scaling Laws |
This video covers scaling laws in neural architectures, including power laws, their limitations, theoretical foundations, and the concept of critical batch size.
Go to the Course Home or watch other lectures: