Doubly Robust Bias Reduction in Infinite Horizon Off-Policy Estimation

Publication
International Conference on Learning Representations(ICLR)