6.7960 Deep Learning
6.7960 - Deep Learning (Fall 2024, MIT OCW). Instructors: Prof. Phillip Isola, Prof. Sara Beery, and Dr. Jeremy Bernstein. This course covers the fundamentals of deep learning, including both theory and applications. Topics include neural net architectures (MLPs, CNNs, RNNs, graph nets, transformers), geometry and invariances in deep learning, backpropagation and automatic differentiation, learning theory and generalization in high dimensions, and applications to computer vision, natural language processing, and robotics. (from ocw.mit.edu)
| Lecture 08 - Architectures: Transformers |
This video introduces transformers, focusing on three key ideas: tokens, attention, and positional codes. It also explores how transformers relate to MLPs, GNNs, and CNNs as variations on common principles.
Go to the Course Home or watch other lectures: