Uncertainty

Off-Policy Interval Estimation with Lipschitz Value Iteration

Accountable Off-Policy Evaluation With Kernel Bellman Statistics