Off-policy evaluation for large action spaces via conjunct effect modeling

Y Saito, Q Ren, T Joachims - international conference on …, 2023 - proceedings.mlr.press
We study off-policy evaluation (OPE) of contextual bandit policies for large discrete action
spaces where conventional importance-weighting approaches suffer from excessive …

[PDF][PDF] Off-Policy Evaluation for Large Action Spaces via Conjunct Effect Modeling

Y Saito, Q Ren, T Joachims - stat, 2023 - researchgate.net
We study off-policy evaluation (OPE) of contextual bandit policies for large discrete action
spaces where conventional importance-weighting approaches suffer from excessive …

Off-Policy Evaluation for Large Action Spaces via Conjunct Effect Modeling

Y Saito, Q Ren, T Joachims - openreview.net
We study off-policy evaluation (OPE) of contextual bandit policies for large discrete action
spaces where conventional importance-weighting approaches suffer from excessive …

Off-policy evaluation for large action spaces via conjunct effect modeling

Y Saito, Q Ren, T Joachims - … of the 40th International Conference on …, 2023 - dl.acm.org
We study off-policy evaluation (OPE) of contextual bandit policies for large discrete action
spaces where conventional importance-weighting approaches suffer from excessive …

[PDF][PDF] Off-Policy Evaluation for Large Action Spaces via Conjunct Effect Modeling

Y Saito, Q Ren, T Joachims - proceedings.mlr.press
We study off-policy evaluation (OPE) of contextual bandit policies for large discrete action
spaces where conventional importance-weighting approaches suffer from excessive …

[引用][C] Off-Policy Evaluation for Large Action Spaces via Conjunct Effect Modeling

Y Saito, Q Ren, T Joachims - International Conference on Machine …, 2023 - par.nsf.gov

Off-Policy Evaluation for Large Action Spaces via Conjunct Effect Modeling

Y Saito, Q Ren, T Joachims - arXiv preprint arXiv:2305.08062, 2023 - arxiv.org
We study off-policy evaluation (OPE) of contextual bandit policies for large discrete action
spaces where conventional importance-weighting approaches suffer from excessive …

Off-Policy Evaluation for Large Action Spaces via Conjunct Effect Modeling

Y Saito, Q Ren, T Joachims - arXiv e-prints, 2023 - ui.adsabs.harvard.edu
We study off-policy evaluation (OPE) of contextual bandit policies for large discrete action
spaces where conventional importance-weighting approaches suffer from excessive …