Adaptive treatment assignment in experiments for policy choice

M Kasy, A Sautmann - Econometrica, 2021 - Wiley Online Library
Standard experimental designs are geared toward point estimation and hypothesis testing,
while bandit algorithms are geared toward in‐sample outcomes. Here, we instead consider …

[HTML][HTML] Causal Machine Learning and its use for public policy

M Lechner - Swiss Journal of Economics and Statistics, 2023 - Springer
In recent years, microeconometrics experienced the 'credibility revolution', culminating in the
2021 Nobel prices for David Card, Josh Angrist, and Guido Imbens. This 'revolution'in how to …

[HTML][HTML] Who should get vaccinated? Individualized allocation of vaccines over SIR network

T Kitagawa, G Wang - Journal of Econometrics, 2023 - Elsevier
How to allocate vaccines over heterogeneous individuals is one of the important policy
decisions in pandemic times. This paper develops a procedure to estimate an individualized …

Optimal dynamic treatment regimes and partial welfare ordering

S Han - Journal of the American Statistical Association, 2023 - Taylor & Francis
Dynamic treatment regimes are treatment allocations tailored to heterogeneous individuals
(eg, via previous outcomes and covariates). The optimal dynamic treatment regime is a …

A distribution optimization framework for confidence bounds of risk measures

H Liang, Z Luo - International Conference on Machine …, 2023 - proceedings.mlr.press
We present a distribution optimization framework that significantly improves confidence
bounds for various risk measures compared to previous methods. Our framework …

Dynamic Targeting: Experimental Evidence from Energy Rebate Programs

T Ida, T Ishihara, K Ito, D Kido, T Kitagawa… - 2024 - nber.org
Economic policies often involve dynamic interventions, where individuals receive repeated
interventions over multiple periods. This dynamics makes past responses informative to …

[HTML][HTML] Treatment recommendation with distributional targets

AB Kock, D Preinerstorfer, B Veliyev - Journal of Econometrics, 2023 - Elsevier
We study the problem of a decision maker who must provide the best possible treatment
recommendation based on an experiment. The desirability of the outcome distribution …

Bandit algorithms for policy learning: methods, implementation, and welfare-performance

T Kitagawa, J Rowley - The Japanese Economic Review, 2024 - Springer
Static supervised learning—in which experimental data serves as a training sample for the
estimation of an optimal treatment assignment policy—is a commonly assumed framework of …

Dynamically optimal treatment allocation using reinforcement learning

K Adusumilli, F Geiecke, C Schilter - arXiv preprint arXiv:1904.01047, 2019 - arxiv.org
Dynamic decisions are pivotal to economic policy making. We show how existing evidence
from randomized control trials can be utilized to guide personalized decisions in challenging …

Regularizing Discrimination in Optimal Policy Learning with Distributional Targets

AB Kock, D Preinerstorfer - arXiv preprint arXiv:2401.17909, 2024 - arxiv.org
A decision maker typically (i) incorporates training data to learn about the relative
effectiveness of the treatments, and (ii) chooses an implementation mechanism that implies …