CS395T - Fall 2025

Lecture Schedule

Below is the tentative schedule for the course. Note that dates and topics may change as the semester progresses.

The schedule can be found in Excel format here for paper presentation and review sign-ups.

Lecture Date Topic Materials/Readings Assignments & Deadlines
1 8/26 (T) Introduction (Slides)
2 8/28 (Th) Abstract NN & gradient computation (Slides)
3 9/2 (T) DNNs, CNNs, RNNs, practical issues (Slides)
4 9/4 (Th) Attention, Transformers, LLMs
5 9/9 (T) Presentations: Optimizing Attention
6 9/11 (Th) Monte Carlo methods & variance reduction
  • Barto & Sutton – Ch 5 Monte Carlo Methods
7 9/16 (T) Presentations: DeepSeek
8 9/18 (Th) Markov Decision Processes (MDPs)
  • Barto & Sutton – Ch 3 MDPs
9 9/23 (T) Sampling (TD(0), TD(n), MC, Q-learning, MC)
  • Barto & Sutton – Ch 6 Temporal Difference Learning
10 9/25 (Th) Presentations: DQN, Double DQN, Experience Replay
11 9/30 (T) Policy gradients (I): REINFORCE
12 10/2 (Th) Presentations: RL Environments
13 10/7 (T) Policy gradients (II): Baseline methods
  • Tutorial TBD
14 10/9 (Th) Presentations: Actor-Critics & DDPG
15 10/14 (T) Policy gradients (III): Trust-region methods
  • Definitive Guide to Policy Gradients §3-4 (Matthias Lehmann, 2024)
  • Trust Region Policy Optimization (TRPO) (John Schulman, Sergey Levine, Pieter Abbeel, Michael Jordan, Philipp Moritz, 2015)
  • Proximal Policy Optimization (PPO) (John Schulman, Filip Wolski, Prafulla Dhariwal, Alec Radford, Oleg Klimov, 2017)
  • Project ideas due (meeting required)
16 10/16 (Th) Presentations: Policy Optimization Methods
  • DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
17 10/21 (T) Reinforcement Learning from Human Feedback (RLHF) and Imitation Learning
18 10/23 (Th) Presentations: RLHF
19 10/28 (T) Presentations: Imitation Learning
20 10/30 (Th) Evolutionary Computation
21 11/4 (T) Presentations: Applications of Evolutionary Computation
21 11/4 (T) Imitation Learning
    22 11/6 (Th) Presentations: ML for Systems
    23 11/11 (T) Presentations: ML for Systems (II)
    24 11/13 (Th) Parallel/Distributed RL
    • Tutorial TBD
  • Project check-in (meeting required)
  • 25 11/18 (T) Presentations: Large-scale distributed RL
    26 11/20 (Th) Presentations: Other RL Topics
    THANKSGIVING BREAK
    27 12/2 (T) Project presentations
    28 12/4 (Th) Project presentations
    • Final project paper due