Short notes on various topics (use with your own risk).
The Clipping Function in PPO
The Issue of Data Drifting in PPO
Differentiation Under the Integral Sign
Integration by Parts on \(\mathbb R^d\) and Stein's Identity
The Gumbel-Max Trick and Lehmann Family
DiffusionNFT as Taylor Approximation
Density Ratio Estimation