Off Policy Evaluation

Off-Policy Interval Estimation with Lipschitz Value Iteration

Accountable Off-Policy Evaluation With Kernel Bellman Statistics

Doubly Robust Bias Reduction in Infinite Horizon Off-Policy Estimation

Breaking the curse of horizon: Infinite-horizon off-policy estimation