Robustness of Whittle index policy to model approximation

N Akbarzadeh, A Mahajan - IEEE Transactions on Control of …, 2023 - ieeexplore.ieee.org

Reinforcement learning is an attractive approach to learn good resource allocation and
scheduling policies based on data when the system model is unknown. However, the …

被引用次数：13 相关文章所有 9 个版本

[PDF] arxiv.org

Limited resource allocation in a non-Markovian world: the case of maternal and child healthcare

P Danassis, S Verma, JA Killian, A Taneja… - arXiv preprint arXiv …, 2023 - arxiv.org

The success of many healthcare programs depends on participants' adherence. We
consider the problem of scheduling interventions in low resource settings (eg, placing timely …

被引用次数：6 相关文章所有 3 个版本

[PDF] ijcai.org

[PDF][PDF] Towards a Pretrained Model for Restless Bandits via Multi-arm Generalization

Y Zhao, N Behari, E Hughes, E Zhang, D Nagaraj… - 2024 - ijcai.org

Restless multi-arm bandits (RMABs) is a class of resource allocation problems with broad
application in areas such as healthcare, online advertising, and anti-poaching. We explore …

被引用次数：1 相关文章

[PDF] arxiv.org

Towards Zero Shot Learning in Restless Multi-armed Bandits

Y Zhao, N Behari, E Hughes, E Zhang… - arXiv preprint arXiv …, 2023 - arxiv.org

Restless multi-arm bandits (RMABs), a class of resource allocation problems with broad
application in areas such as healthcare, online advertising, and anti-poaching, have recently …

被引用次数：3 相关文章所有 6 个版本

[PDF] mcgill.ca

[图书][B] Restless Bandits: Indexability, Computation of Whittle Index and Learning

N Akbarzadeh - 2022 - search.proquest.com

Restless bandits are a class of sequential resource allocation problems concerned with
allocating one or more resources among several alternative processes where the evolution …

高级搜索

QQ 群