Learning the optimal control for evolving systems with converging dynamics

Q Liu, Z Fang - Proceedings of the ACM on Measurement and Analysis …, 2024 - dl.acm.org
We consider a principle or controller that can pick actions from a fixed action set to control an
evolving system with converging dynamics. The actions are interpreted as different …

Fair Resource Allocation in Weakly Coupled Markov Decision Processes

X Tu, Y Adulyasak, N Akbarzadeh, E Delage - arXiv preprint arXiv …, 2024 - arxiv.org
We consider fair resource allocation in sequential decision-making environments modeled
as weakly coupled Markov decision processes, where resource constraints couple the …

Fairness of Exposure in Online Restless Multi-armed Bandits

A Sood, S Jain, S Gujar - arXiv preprint arXiv:2402.06348, 2024 - arxiv.org
Restless multi-armed bandits (RMABs) generalize the multi-armed bandits where each arm
exhibits Markovian behavior and transitions according to their transition dynamics. Solutions …

EduQate: Generating Adaptive Curricula through RMABs in Education Settings

S Tio, D Li, P Varakantham - arXiv preprint arXiv:2406.14122, 2024 - arxiv.org
There has been significant interest in the development of personalized and adaptive
educational tools that cater to a student's individual learning progress. A crucial aspect in …