J Tong, L Fu, Z Han - IEEE Transactions on Wireless …, 2022 - ieeexplore.ieee.org
Age-of-information (AoI) based minimization problems have been widely considered in Internet-of-Things (IoT) networks with the settings of multi-source single-channel systems …
In this paper, we propose an approximate relative value learning (ARVL) algorithm for non- parametric MDPs with continuous state space and finite actions and average reward …
H Li, S Shao, A Gupta - IEEE Control Systems Letters, 2021 - ieeexplore.ieee.org
In this letter, we establish the convergence of fitted value iteration and fitted Q-value iteration for continuous-state continuous-action Markov decision problems (MDPs) with state …
Recursive stochastic algorithms have gained significant attention in the recent past due to data-driven applications. Examples include stochastic gradient descent for solving large …
A Gupta, R Jain, P Glynn - IEEE Transactions on Automatic …, 2024 - ieeexplore.ieee.org
In many branches of engineering, Banach contraction mapping theorem is employed to establish the convergence of certain deterministic algorithms. Randomized versions of these …
H Sharma, R Jain - 2019 57th Annual Allerton Conference on …, 2019 - ieeexplore.ieee.org
It has long been a challenging problem to design algorithms for Markov decision processes (MDPs) with continuous states and actions that are provably approximately optimal and can …
X Yang, J Hu, JQ Hu - IEEE Transactions on Automatic Control, 2024 - ieeexplore.ieee.org
Markov decision processes are widely used for modeling sequential decision-making problems under uncertainty. We propose an online algorithm for solving a class of average …
This dissertation studies decentralized multi-agent collision avoidance and reinforcement learning (RL) for Markov Decision Process (MDP) with state-dependent action constraints …