Y Xu, C Shi, S Luo, L Wang, R Song - arXiv e-prints, 2022 - ui.adsabs.harvard.edu
Off-Policy evaluation (OPE) is concerned with evaluating a new target policy using offline
data generated by a potentially different behavior policy. It is critical in a number of …