Distributional offline policy evaluation with predictive error guarantees

R Wu, M Uehara, W Sun - International Conference on …, 2023 - proceedings.mlr.press
We study the problem of estimating the distribution of the return of a policy using an offline
dataset that is not generated from the policy, ie, distributional offline policy evaluation (OPE) …

SCOPE-RL: A Python Library for Offline Reinforcement Learning and Off-Policy Evaluation

H Kiyohara, R Kishimoto, K Kawakami… - arXiv preprint arXiv …, 2023 - arxiv.org
This paper introduces SCOPE-RL, a comprehensive open-source Python software designed
for offline reinforcement learning (offline RL), off-policy evaluation (OPE), and selection …

Logarithmic Neyman Regret for Adaptive Estimation of the Average Treatment Effect

O Neopane, A Ramdas, A Singh - arXiv preprint arXiv:2411.14341, 2024 - arxiv.org
Estimation of the Average Treatment Effect (ATE) is a core problem in causal inference with
strong connections to Off-Policy Evaluation in Reinforcement Learning. This paper considers …

Functional linear regression of cumulative distribution functions

Q Zhang, A Makur, K Azizzadenesheli - arXiv preprint arXiv:2205.14545, 2022 - arxiv.org
The estimation of cumulative distribution functions (CDFs) is an important learning task with
a great variety of downstream applications, such as risk assessments in predictions and …

Sparse Contextual CDF Regression

K Azizzadenesheli, W Lu, A Makur, Q Zhang - Transactions on Machine … - openreview.net
Estimating cumulative distribution functions (CDFs) of context-dependent random variables
is a central statistical task underpinning numerous applications in machine learning and …

[HTML][HTML] The Mechanism of Institutional Governance for China's PPP Projects

L Sun, X Hu, W Wang, X Li - Open Journal of Social Sciences, 2024 - scirp.org
Taking the current PPP governance in China as the research object, this paper explores the
governance structure and the role of institutional governance. Grounded theory is used in …