H Namkoong, R Keramati, S Yadlowsky… - Proceedings of the 34th …, 2020 - dl.acm.org
When observed decisions depend only on observed features, off-policy policy evaluation
(OPE) methods for sequential decision problems can estimate the performance of evaluation …