Offline reinforcement learning for autonomous driving with safety and exploration enhancement

P Agarwal, AA Rahman, PL St-Charles… - arXiv preprint arXiv …, 2023 - arxiv.org

Transformers have significantly impacted domains like natural language processing,
computer vision, and robotics, where they improve performance compared to other neural …

被引用次数：10 相关文章所有 2 个版本

[PDF] arxiv.org

Distributionally robust offline reinforcement learning with linear function approximation

X Ma, Z Liang, J Blanchet, M Liu, L Xia, J Zhang… - arXiv preprint arXiv …, 2022 - arxiv.org

Among the reasons hindering reinforcement learning (RL) applications to real-world
problems, two factors are critical: limited data and the mismatch between the testing …

被引用次数：24 相关文章所有 2 个版本

[PDF] mlr.press

Provably efficient offline reinforcement learning for partially observable markov decision processes

H Guo, Q Cai, Y Zhang, Z Yang… - … on Machine Learning, 2022 - proceedings.mlr.press

We study offline reinforcement learning (RL) for partially observable Markov decision
processes (POMDPs) with possibly infinite state and observation spaces. Under the …

被引用次数：15 相关文章所有 5 个版本

[PDF] arxiv.org

An empirical study of implicit regularization in deep offline rl

C Gulcehre, S Srinivasan, J Sygnowski… - arXiv preprint arXiv …, 2022 - arxiv.org

Deep neural networks are the most commonly used function approximators in offline
reinforcement learning. Prior works have shown that neural nets trained with TD-learning …

被引用次数：13 相关文章所有 4 个版本

[PDF] arxiv.org

Boosting offline reinforcement learning for autonomous driving with hierarchical latent skills

Z Li, F Nie, Q Sun, F Da, H Zhao - arXiv preprint arXiv:2309.13614, 2023 - arxiv.org

Learning-based vehicle planning is receiving increasing attention with the emergence of
diverse driving simulators and large-scale driving datasets. While offline reinforcement …

被引用次数：3 相关文章所有 2 个版本

[PDF] arxiv.org

A model-based solution to the offline multi-agent reinforcement learning coordination problem

P Barde, J Foerster, D Nowrouzezahrai… - arXiv preprint arXiv …, 2023 - arxiv.org

Training multiple agents to coordinate is an essential problem with applications in robotics,
game theory, economics, and social sciences. However, most existing Multi-Agent …

被引用次数：4 相关文章所有 5 个版本

[PDF] arxiv.org

Uncertainty-Aware Decision Transformer for Stochastic Driving Environments

Z Li, F Nie, Q Sun, F Da, H Zhao - arXiv preprint arXiv:2309.16397, 2023 - arxiv.org

Offline Reinforcement Learning (RL) has emerged as a promising framework for learning
policies without active interactions, making it especially appealing for autonomous driving …

被引用次数：1 相关文章所有 3 个版本

[PDF] acm.org

Optimizing traffic control with model-based learning: A pessimistic approach to data-efficient policy inference

M Kunjir, S Chawla, S Chandrasekar, D Jay… - Proceedings of the 29th …, 2023 - dl.acm.org

Traffic signal control is an important problem in urban mobility with a significant potential for
economic and environmental impact. While there is a growing interest in Reinforcement …

被引用次数：1 相关文章

Lyapunov stability regulation of deep reinforcement learning control with application to automated driving

B Hejase, U Ozguner - 2023 American Control Conference …, 2023 - ieeexplore.ieee.org

Reinforcement learning (RL) control for nonlinear dynamical systems has seen increasing
interests in recent years. However, these methods have limited practical use due to the lack …

被引用次数：3 相关文章

[PDF] aaai.org

NondBREM: Nondeterministic Offline Reinforcement Learning for Large-Scale Order Dispatching

H Zhang, G Wang, X Wang, Z Zhou, C Zhang… - Proceedings of the …, 2024 - ojs.aaai.org

One of the most important tasks in ride-hailing is order dispatching, ie, assigning unserved
orders to available drivers. Recent order dispatching has achieved a significant …

被引用次数：1 相关文章

高级搜索

QQ 群