Queueing network controls via deep reinforcement learning

SS Gill, M Golec, J Hu, M Xu, J Du, H Wu, GK Walia… - Cluster …, 2025 - Springer

Abstract Edge Artificial Intelligence (AI) incorporates a network of interconnected systems
and devices that receive, cache, process, and analyse data in close communication with the …

被引用次数：15 相关文章所有 10 个版本

[PDF] neurips.cc

Policy optimization for continuous reinforcement learning

H Zhao, W Tang, D Yao - Advances in Neural Information …, 2024 - proceedings.neurips.cc

We study reinforcement learning (RL) in the setting of continuous time and space, for an
infinite horizon with a discounted objective and the underlying dynamics driven by a …

被引用次数：18 相关文章所有 6 个版本

[PDF] mlr.press

Hindsight learning for mdps with exogenous inputs

SR Sinclair, FV Frujeri, CA Cheng… - International …, 2023 - proceedings.mlr.press

Many resource management problems require sequential decision-making under
uncertainty, where the only uncertainty affecting the decision outcomes are exogenous …

被引用次数：21 相关文章所有 7 个版本

[PDF] neurips.cc

Finite Sample Analysis of Average-Reward TD Learning and -Learning

S Zhang, Z Zhang, ST Maguluri - Advances in Neural …, 2021 - proceedings.neurips.cc

The focus of this paper is on sample complexity guarantees of average-reward
reinforcement learning algorithms, which are known to be more challenging to study than …

被引用次数：27 相关文章所有 7 个版本

[PDF] arxiv.org

Learning and information in stochastic networks and queues

N Walton, K Xu - Tutorials in Operations Research …, 2021 - pubsonline.informs.org

We review the role of information and learning in the stability and optimization of queueing
systems. In recent years, techniques from supervised learning, online learning, and …

被引用次数：35 相关文章所有 3 个版本

[PDF] neurips.cc

Sample efficient reinforcement learning in mixed systems through augmented samples and its applications to queueing networks

H Wei, X Liu, W Wang, L Ying - Advances in Neural …, 2024 - proceedings.neurips.cc

This paper considers a class of reinforcement learning problems, which involve systems with
two types of states: stochastic and pseudo-stochastic. In such systems, stochastic states …

被引用次数：11 相关文章所有 6 个版本

[PDF] arxiv.org

An online learning approach to dynamic pricing and capacity sizing in service systems

X Chen, Y Liu, G Hong - Operations Research, 2024 - pubsonline.informs.org

We study a dynamic pricing and capacity sizing problem in a GI/GI/1 queue, in which the
service provider's objective is to obtain the optimal service fee p and service capacity μ so as …

被引用次数：38 相关文章所有 6 个版本

[PDF] ssrn.com

Online learning and pricing for service systems with reusable resources

H Jia, C Shi, S Shen - Operations Research, 2024 - pubsonline.informs.org

We consider a price-based revenue management problem with finite reusable resources
over a finite time horizon T. Customers arrive following a price-dependent Poisson process …

被引用次数：37 相关文章所有 7 个版本

[PDF] neurips.cc

Bayesian learning of optimal policies in markov decision processes with countably infinite state-space

S Adler, V Subramanian - Advances in Neural Information …, 2024 - proceedings.neurips.cc

Abstract Models of many real-life applications, such as queueing models of communication
networks or computing systems, have a countably infinite state-space. Algorithmic and …

被引用次数：5 相关文章所有 5 个版本

[PDF] ieee.org

Scalable deep reinforcement learning for ride-hailing

J Feng, M Gluzman, JG Dai - 2021 American Control …, 2021 - ieeexplore.ieee.org

Ride-hailing services, such as Didi Chuxing, Lyft, and Uber, arrange thousands of cars to
meet ride requests throughout the day. We consider a Markov decision process (MDP) …

被引用次数：28 相关文章所有 4 个版本

高级搜索

QQ 群