[HTML][HTML] Markovian restless bandits and index policies: A review

J Niño-Mora - Mathematics, 2023 - mdpi.com
The restless multi-armed bandit problem is a paradigmatic modeling framework for optimal
dynamic priority allocation in stochastic models of wide-ranging applications that has been …

Autonomous tracking using a swarm of UAVs: A constrained multi-agent reinforcement learning approach

YJ Chen, DK Chang, C Zhang - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
In this paper, we aim to design an autonomous tracking system for a swarm of unmanned
aerial vehicles (UAVs) to localize a radio frequency (RF) mobile target. In the system, UAVs …

[HTML][HTML] Scheduling to minimize age of incorrect information with imperfect channel state information

Y Chen, A Ephremides - Entropy, 2021 - mdpi.com
In this paper, we study a slotted-time system where a base station needs to update multiple
users at the same time. Due to the limited resources, only part of the users can be updated in …

A Whittle index policy for the remote estimation of multiple continuous Gauss-Markov processes over parallel channels

TZ Ornee, Y Sun - Proceedings of the Twenty-fourth International …, 2023 - dl.acm.org
In this paper, we study a sampling and transmission scheduling problem for multi-source
remote estimation, where a scheduler determines when to take samples from multiple …

Conditions for indexability of restless bandits and an algorithm to compute Whittle index

N Akbarzadeh, A Mahajan - Advances in Applied Probability, 2022 - cambridge.org
Restless bandits are a class of sequential resource allocation problems concerned with
allocating one or more resources among several alternative processes where the evolution …

Uncertainty-of-information scheduling: A restless multiarmed bandit framework

G Chen, SC Liew, Y Shao - IEEE Transactions on Information …, 2022 - ieeexplore.ieee.org
This paper proposes using the uncertainty of information (UoI), measured by Shannon's
entropy, as a metric for information freshness. We consider a system in which a central …

Multi-channel transmission scheduling with hopping scheme under uncertain channel states

Y Song, D Ye - Journal of the Franklin Institute, 2023 - Elsevier
This paper investigates the multi-channel transmission scheduling problem for remote state
estimation based on a hopping scheme in cyber-physical systems. The smart sensor sends …

On learning Whittle index policy for restless bandits with scalable regret

N Akbarzadeh, A Mahajan - IEEE Transactions on Control of …, 2023 - ieeexplore.ieee.org
Reinforcement learning is an attractive approach to learn good resource allocation and
scheduling policies based on data when the system model is unknown. However, the …

On two sensors scheduling for remote state estimation with a shared memory channel in a cyber-physical system environment

J Wei, D Ye - IEEE Transactions on Cybernetics, 2021 - ieeexplore.ieee.org
This article studies two sensors scheduling with a shared memory channel for remote state
estimation in cyber-physical systems (CPSs). We consider that each sensor monitors a plant …

User association in dense mmwave networks as restless bandits

SK Singh, VS Borkar… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
In this article, we study the problem of user association, ie, determining which base station
(BS) a user should associate with, in a dense millimeter wave (mmWave) network. In our …