Avoiding starvation of arms in restless multi-armed bandit

文章

学术资源搜索

获得 4 条结果（用时0.05秒）

我的图书馆

Avoiding starvation of arms in restless multi-armed bandit

在引用文章中搜索

[PDF] acm.org

Learning the optimal control for evolving systems with converging dynamics

Q Liu, Z Fang - Proceedings of the ACM on Measurement and Analysis …, 2024 - dl.acm.org

We consider a principle or controller that can pick actions from a fixed action set to control an
evolving system with converging dynamics. The actions are interpreted as different …

被引用次数：1 相关文章所有 2 个版本

[PDF] arxiv.org

Fair Resource Allocation in Weakly Coupled Markov Decision Processes

X Tu, Y Adulyasak, N Akbarzadeh, E Delage - arXiv preprint arXiv …, 2024 - arxiv.org

We consider fair resource allocation in sequential decision-making environments modeled
as weakly coupled Markov decision processes, where resource constraints couple the …

Fairness of Exposure in Online Restless Multi-armed Bandits

A Sood, S Jain, S Gujar - arXiv preprint arXiv:2402.06348, 2024 - arxiv.org

Restless multi-armed bandits (RMABs) generalize the multi-armed bandits where each arm
exhibits Markovian behavior and transitions according to their transition dynamics. Solutions …

被引用次数：1 相关文章所有 4 个版本

[PDF] arxiv.org

EduQate: Generating Adaptive Curricula through RMABs in Education Settings

S Tio, D Li, P Varakantham - arXiv preprint arXiv:2406.14122, 2024 - arxiv.org

There has been significant interest in the development of personalized and adaptive
educational tools that cater to a student's individual learning progress. A crucial aspect in …

高级搜索

QQ 群

Avoiding starvation of arms in restless multi-armed bandit

Learning the optimal control for evolving systems with converging dynamics

Fair Resource Allocation in Weakly Coupled Markov Decision Processes

Fairness of Exposure in Online Restless Multi-armed Bandits

EduQate: Generating Adaptive Curricula through RMABs in Education Settings

引用