Multi-agent Cooperative Area Coverage: A Two-stage Planning Approach Based on Reinforcement Learning

G Yuan, J Xiao, J He, H Jia, Y Wang, Z Wang - Information Sciences, 2024 - Elsevier
Multi-agent area coverage aims to accomplish the complete traversal of the target area
through cooperation between agents. Focusing on the problems of low coverage efficiency …

On-Policy deep reinforcement learning approach to multi agent problems

Z Tan, M Karakose - Interdisciplinary Research in Technology …, 2021 - taylorfrancis.com
Reinforcement learning approach has been preferred by researchers and scientists in
recent years, especially due to its superior performance in robot studies. While smart …

SEGAC: Sample Efficient Generalized Actor Critic for the Stochastic On-Time Arrival Problem

H Guo, Z He, W Sheng, Z Cao, Y Zhou… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
This paper studies the problem in transportation networks and introduces a novel
reinforcement learning-based algorithm, namely. Different from almost all canonical sota …

MEC-based jamming-aided anti-eavesdropping with deep reinforcement learning for WBANs

G Chen, X Liu, M Shorfuzzaman, A Karime… - ACM Transactions on …, 2021 - dl.acm.org
Wireless body area network (WBAN) suffers secure challenges, especially the
eavesdropping attack, due to constraint resources. In this article, deep reinforcement …

Navigation with time limits in transportation networks: A fourth moment approach

H Guo, Z He, C Gao, D Rus - IEEE Transactions on Intelligent …, 2022 - ieeexplore.ieee.org
This paper investigates the stochastic on-time arrival (SOTA) problem in transportation
networks. We propose a fourth moment approach (FMA), which calculates the tight lower …

[PDF][PDF] Comparative evaluation for effectiveness analysis of policy based deep reinforcement learning approaches

Z Tan, M Karaköse - International Journal of Computer …, 2021 - pdfs.semanticscholar.org
Deep Reinforcement Learning (DRL) has proven to be a very strong technique with results
in various applications in recent years. Especially the achievements in the studies in the field …

Optimized modeling and opportunity cost analysis for overloaded interconnected dangerous goods in warehouse operations

J Sun, F Zhang, P Lu, J Yee - Applied Mathematical Modelling, 2021 - Elsevier
Storage plays an important part in the field of logistics; however, conventional storage and
operation may result in an overloaded condition due to low management efficiency. To …

SRNN-RSA: a new method to solving time-dependent shortest path problems based on structural recurrent neural network and ripple spreading algorithm

S Yu, Y Song - Complex & Intelligent Systems, 2024 - Springer
Influenced by external factors, the speed of vehicles in the traffic network is changing all the
time, which makes the traditional static shortest route unable to meet the real logistics …

Solving pickup and drop-off problem using hybrid pointer networks with deep reinforcement learning

MG Alharbi, A Stohy, M Elhenawy, M Masoud… - Plos one, 2022 - journals.plos.org
In this study, we propose a general method for tackling the Pickup and Drop-off Problem
(PDP) using Hybrid Pointer Networks (HPNs) and Deep Reinforcement Learning (DRL). Our …

Variational Delayed Policy Optimization

Q Wu, SS Zhan, Y Wang, Y Wang, CW Lin, C Lv… - arXiv preprint arXiv …, 2024 - arxiv.org
In environments with delayed observation, state augmentation by including actions within
the delay window is adopted to retrieve Markovian property to enable reinforcement learning …