Lecture Schedule
Below is the tentative schedule for the course. Note that dates and topics may change as the semester progresses.
The schedule can be found in Excel format here for paper presentation and review sign-ups.
Lecture | Date | Topic | Materials/Readings | Assignments & Deadlines |
---|---|---|---|---|
1 | 8/26 (T) | Introduction (Slides) |
|
|
2 | 8/28 (Th) | Abstract NN & gradient computation (Slides) |
|
|
3 | 9/2 (T) | DNNs, CNNs, RNNs, practical issues (Slides) |
|
|
4 | 9/4 (Th) | Attention, Transformers, LLMs (Slides) |
|
|
5 | 9/9 (T) | Presentations: Optimizing Attention (Slides) |
|
|
6 | 9/11 (Th) | Monte Carlo methods & variance reduction (Slides) |
|
|
7 | 9/16 (T) | Presentations: Optimizing LLMs |
|
|
8 | 9/18 (Th) | Markov Decision Processes (MDPs) (Slides) |
|
|
9 | 9/23 (T) | Sampling (TD(0), TD(n), Q-learning) (Slides) |
|
|
10 | 9/25 (Th) | Sampling II (MC) (Slides) |
|
|
11 | 9/30 (T) | Presentations: Deep Q-Networks (DQN) (Slides), Hindsight Experience Replay (Slides) |
|
|
12 | 10/2 (Th) | Policy gradients (I): REINFORCE (Slides) |
|
|
13 | 10/7 (T) | Presentations: RL Environments |
|
|
14 | 10/9 (Th) | Policy gradients (II): Baseline methods (Slides) | ||
15 | 10/14 (T) | Presentations: Actor-Critics (Slides) & DDPG (Slides) |
|
|
16 | 10/16 (Th) | Policy gradients (III): Trust-region methods (Slides) |
|
|
17 | 10/21 (T) | Presentations: Policy Gradient Methods |
|
|
18 | 10/23 (Th) | Reinforcement Learning from Human Feedback (RLHF) and Imitation Learning |
|
|
19 | 10/28 (T) | Presentations: RLHF |
|
|
20 | 10/30 (Th) | Presentations: Imitation Learning |
|
|
21 | 11/4 (T) | Evolutionary Computation |
|
|
22 | 11/6 (Th) | Presentations: Applications of Evolutionary Computation |
|
|
23 | 11/11 (T) | Presentations: ML for Systems |
|
|
24 | 11/13 (Th) | Presentations: ML for Systems (II) |
|
|
25 | 11/18 (T) | Presentations: Large-scale distributed RL | ||
26 | 11/20 (Th) | Presentations: Other RL Topics |
|
|
THANKSGIVING BREAK | ||||
27 | 12/2 (T) | Project presentations | ||
28 | 12/4 (Th) | Project presentations |
|