In many applications, eg in healthcare and e-commerce, the goal of a contextual bandit may be to learn an optimal treatment assignment policy at the end of the experiment. That is, to …
We study the problem of experiment planning with function approximation in contextual bandit problems. In settings where there is a significant overhead to deploying adaptive …
Most linear experimental design problems assume homogeneous variance, while the presence of heteroskedastic noise is present in many realistic settings. Let a learner have …
In an attempt to make algorithms fair, the machine learning literature has largely focused on equalizing decisions, outcomes, or error rates across race or gender groups. To illustrate …
In digital marketing, experimenting with new website content is one of the key levers to improve customer engagement. However, creating successful marketing content is a manual …
S Geng, H Nassif… - Uncertainty in Artificial …, 2023 - proceedings.mlr.press
In dynamic discrete choice models, a commonly studied problem is estimating parameters of agent reward functions (also known as' structural'parameters) using agent behavioral data …
Z Li, K Jamieson, L Jain - International Conference on …, 2024 - proceedings.mlr.press
Given a set of arms $\mathcal {Z}\subset\mathbb {R}^ d $ and an unknown parameter vector $\theta_\ast\in\mathbb {R}^ d $, the pure exploration linear bandits problem aims to return …
J Chen, N Sharma, T Khan, S Liu, B Chang… - Proceedings of the …, 2023 - dl.acm.org
Cache management is critical for Content Delivery Networks (CDNs), impacting their performance and operational costs. Most production CDNs apply static, hand-tuned caching …
J Du, S Gao, CH Chen - Manufacturing & Service …, 2024 - pubsonline.informs.org
Problem definition: Personalized medicine (PM) seeks the best treatment for each patient among a set of available treatment methods. Because a specific treatment does not work …