Lecture Schedule
Below is the tentative schedule for the course. Note that dates and topics may change as the semester progresses.
The schedule can be found in Excel format here for paper presentation and review sign-ups.
Lecture | Date | Topic | Materials/Readings | Assignments & Deadlines |
---|---|---|---|---|
1 | 8/26 (T) | Introduction (Slides) |
|
|
2 | 8/28 (Th) | Abstract NN & gradient computation (Slides) |
|
|
3 | 9/2 (T) | DNNs, CNNs, RNNs, practical issues (Slides) |
|
|
4 | 9/4 (Th) | Attention, Transformers, LLMs |
|
|
5 | 9/9 (T) | Presentations: Optimizing Attention |
|
|
6 | 9/11 (Th) | Monte Carlo methods & variance reduction |
|
|
7 | 9/16 (T) | Presentations: DeepSeek |
|
|
8 | 9/18 (Th) | Markov Decision Processes (MDPs) |
|
|
9 | 9/23 (T) | Sampling (TD(0), TD(n), MC, Q-learning, MC) |
|
|
10 | 9/25 (Th) | Presentations: DQN, Double DQN, Experience Replay |
|
|
11 | 9/30 (T) | Policy gradients (I): REINFORCE |
|
|
12 | 10/2 (Th) | Presentations: RL Environments |
|
|
13 | 10/7 (T) | Policy gradients (II): Baseline methods |
|
|
14 | 10/9 (Th) | Presentations: Actor-Critics & DDPG |
|
|
15 | 10/14 (T) | Policy gradients (III): Trust-region methods |
|
|
16 | 10/16 (Th) | Presentations: Policy Optimization Methods |
|
|
17 | 10/21 (T) | Reinforcement Learning from Human Feedback (RLHF) and Imitation Learning |
|
|
18 | 10/23 (Th) | Presentations: RLHF |
|
|
19 | 10/28 (T) | Presentations: Imitation Learning |
|
|
20 | 10/30 (Th) | Evolutionary Computation |
|
|
21 | 11/4 (T) | Presentations: Applications of Evolutionary Computation |
|
|
21 | 11/4 (T) | Imitation Learning | ||
22 | 11/6 (Th) | Presentations: ML for Systems |
|
|
23 | 11/11 (T) | Presentations: ML for Systems (II) |
|
|
24 | 11/13 (Th) | Parallel/Distributed RL |
|
|
25 | 11/18 (T) | Presentations: Large-scale distributed RL | ||
26 | 11/20 (Th) | Presentations: Other RL Topics |
|
|
THANKSGIVING BREAK | ||||
27 | 12/2 (T) | Project presentations | ||
28 | 12/4 (Th) | Project presentations |
|