CS395T - Fall 2025

Lecture Schedule

Below is the tentative schedule for the course. Note that dates and topics may change as the semester progresses.

The schedule can be found in Excel format here for paper presentation and review sign-ups.

Lecture Date Topic Materials/Readings Assignments & Deadlines
1 8/26 (T) Introduction (Slides)
2 8/28 (Th) Abstract NN & gradient computation (Slides)
3 9/2 (T) DNNs, CNNs, RNNs, practical issues (Slides)
4 9/4 (Th) Attention, Transformers, LLMs (Slides)
5 9/9 (T) Presentations: Optimizing Attention (Slides)
6 9/11 (Th) Monte Carlo methods & variance reduction (Slides)
  • Barto & Sutton – Ch 5 Monte Carlo Methods
7 9/16 (T) Presentations: Optimizing LLMs
8 9/18 (Th) Markov Decision Processes (MDPs) (Slides)
9 9/23 (T) Sampling (TD(0), TD(n), Q-learning) (Slides)
  • Barto & Sutton – Ch 6 Temporal Difference Learning
10 9/25 (Th) Sampling II (MC) (Slides)
  • Barto & Sutton – Ch 5 Monte Carlo Methods
11 9/30 (T) Presentations: Deep Q-Networks (DQN), Hindsight Experience Replay
12 10/2 (Th) Policy gradients (I): REINFORCE (Slides)
13 10/7 (T) Presentations: RL Environments
14 10/9 (Th) Policy gradients (II): Baseline methods (Slides)
15 10/14 (T) Presentations: Actor-Critics & DDPG
16 10/16 (Th) Policy gradients (III): Trust-region methods (Slides)
  • Definitive Guide to Policy Gradients §3-4 (Matthias Lehmann, 2024)
  • Trust Region Policy Optimization (TRPO) (John Schulman, Sergey Levine, Pieter Abbeel, Michael Jordan, Philipp Moritz, 2015)
  • Proximal Policy Optimization (PPO) (John Schulman, Filip Wolski, Prafulla Dhariwal, Alec Radford, Oleg Klimov, 2017)
17 10/21 (T) Presentations: Policy Optimization Methods
18 10/23 (Th) Reinforcement Learning from Human Feedback (RLHF) and Imitation Learning (Slides)
19 10/28 (T) Presentations: RLHF
20 10/30 (Th) Presentations: Imitation Learning
21 11/4 (T) Evolutionary Computation
22 11/6 (Th) Presentations: Applications of Evolutionary Computation
23 11/11 (T) Presentations: AI-Driven Research for Systems
24 11/13 (Th) Presentations: ML for Systems (II)
  • Project check-in (meeting required)
25 11/18 (T) Presentations: Large-scale distributed RL
26 11/20 (Th) Presentations: Other RL Topics
THANKSGIVING BREAK
27 12/2 (T) Project presentations
28 12/4 (Th) Project presentations
  • Final project paper due