CS395T - Fall 2025

Lecture Schedule

Below is the tentative schedule for the course. Note that dates and topics may change as the semester progresses.

The schedule can be found in Excel format here for paper presentation and review sign-ups.

Lecture Date Topic Materials/Readings Assignments & Deadlines
1 8/26 (T) Introduction (Slides)
2 8/28 (Th) Abstract NN & gradient computation (Slides)
3 9/2 (T) DNNs, CNNs, RNNs, practical issues (Slides)
4 9/4 (Th) Attention, Transformers, LLMs (Slides)
5 9/9 (T) Presentations: Optimizing Attention (Slides)
6 9/11 (Th) Monte Carlo methods & variance reduction (Slides)
  • Barto & Sutton – Ch 5 Monte Carlo Methods
7 9/16 (T) Presentations: Optimizing LLMs
8 9/18 (Th) Markov Decision Processes (MDPs) (Slides)
9 9/23 (T) Sampling (TD(0), TD(n), Q-learning) (Slides)
  • Barto & Sutton – Ch 6 Temporal Difference Learning
10 9/25 (Th) Sampling II (MC) (Slides)
  • Barto & Sutton – Ch 5 Monte Carlo Methods
11 9/30 (T) Presentations: Deep Q-Networks (DQN) (Slides), Hindsight Experience Replay (Slides)
12 10/2 (Th) Policy gradients (I): REINFORCE (Slides)
13 10/7 (T) Presentations: RL Environments
14 10/9 (Th) Policy gradients (II): Baseline methods (Slides)
15 10/14 (T) Presentations: Actor-Critics (Slides) & DDPG (Slides)
16 10/16 (Th) Policy gradients (III): Trust-region methods (Slides)
  • Definitive Guide to Policy Gradients §3-4 (Matthias Lehmann, 2024)
  • Trust Region Policy Optimization (TRPO) (John Schulman, Sergey Levine, Pieter Abbeel, Michael Jordan, Philipp Moritz, 2015)
  • Proximal Policy Optimization (PPO) (John Schulman, Filip Wolski, Prafulla Dhariwal, Alec Radford, Oleg Klimov, 2017)
17 10/21 (T) Presentations: Policy Gradient Methods
  • DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
18 10/23 (Th) Reinforcement Learning from Human Feedback (RLHF) and Imitation Learning
19 10/28 (T) Presentations: RLHF
20 10/30 (Th) Presentations: Imitation Learning
21 11/4 (T) Evolutionary Computation
22 11/6 (Th) Presentations: Applications of Evolutionary Computation
23 11/11 (T) Presentations: ML for Systems
24 11/13 (Th) Presentations: ML for Systems (II)
  • Project check-in (meeting required)
25 11/18 (T) Presentations: Large-scale distributed RL
26 11/20 (Th) Presentations: Other RL Topics
THANKSGIVING BREAK
27 12/2 (T) Project presentations
28 12/4 (Th) Project presentations
  • Final project paper due