Breaking the curse of horizon: Infinite-horizon off-policy estimation

Publication
Advances in Neural Information Processing Systems