Transformers in reinforcement learning: a survey

P Agarwal, AA Rahman, PL St-Charles… - arXiv preprint arXiv …, 2023 - arxiv.org
Transformers have significantly impacted domains like natural language processing,
computer vision, and robotics, where they improve performance compared to other neural …

Distributionally robust offline reinforcement learning with linear function approximation

X Ma, Z Liang, J Blanchet, M Liu, L Xia, J Zhang… - arXiv preprint arXiv …, 2022 - arxiv.org
Among the reasons hindering reinforcement learning (RL) applications to real-world
problems, two factors are critical: limited data and the mismatch between the testing …

Provably efficient offline reinforcement learning for partially observable markov decision processes

H Guo, Q Cai, Y Zhang, Z Yang… - … on Machine Learning, 2022 - proceedings.mlr.press
We study offline reinforcement learning (RL) for partially observable Markov decision
processes (POMDPs) with possibly infinite state and observation spaces. Under the …

An empirical study of implicit regularization in deep offline rl

C Gulcehre, S Srinivasan, J Sygnowski… - arXiv preprint arXiv …, 2022 - arxiv.org
Deep neural networks are the most commonly used function approximators in offline
reinforcement learning. Prior works have shown that neural nets trained with TD-learning …

Boosting offline reinforcement learning for autonomous driving with hierarchical latent skills

Z Li, F Nie, Q Sun, F Da, H Zhao - arXiv preprint arXiv:2309.13614, 2023 - arxiv.org
Learning-based vehicle planning is receiving increasing attention with the emergence of
diverse driving simulators and large-scale driving datasets. While offline reinforcement …

A model-based solution to the offline multi-agent reinforcement learning coordination problem

P Barde, J Foerster, D Nowrouzezahrai… - arXiv preprint arXiv …, 2023 - arxiv.org
Training multiple agents to coordinate is an essential problem with applications in robotics,
game theory, economics, and social sciences. However, most existing Multi-Agent …

Uncertainty-Aware Decision Transformer for Stochastic Driving Environments

Z Li, F Nie, Q Sun, F Da, H Zhao - arXiv preprint arXiv:2309.16397, 2023 - arxiv.org
Offline Reinforcement Learning (RL) has emerged as a promising framework for learning
policies without active interactions, making it especially appealing for autonomous driving …

Optimizing traffic control with model-based learning: A pessimistic approach to data-efficient policy inference

M Kunjir, S Chawla, S Chandrasekar, D Jay… - Proceedings of the 29th …, 2023 - dl.acm.org
Traffic signal control is an important problem in urban mobility with a significant potential for
economic and environmental impact. While there is a growing interest in Reinforcement …

Lyapunov stability regulation of deep reinforcement learning control with application to automated driving

B Hejase, U Ozguner - 2023 American Control Conference …, 2023 - ieeexplore.ieee.org
Reinforcement learning (RL) control for nonlinear dynamical systems has seen increasing
interests in recent years. However, these methods have limited practical use due to the lack …

NondBREM: Nondeterministic Offline Reinforcement Learning for Large-Scale Order Dispatching

H Zhang, G Wang, X Wang, Z Zhou, C Zhang… - Proceedings of the …, 2024 - ojs.aaai.org
One of the most important tasks in ride-hailing is order dispatching, ie, assigning unserved
orders to available drivers. Recent order dispatching has achieved a significant …