trajectory planning reinforcement learning- 学术资源搜索

过去一年中添加的文章，按日期排序

Optimal Robust Formation of Multi-Agent Systems as Adversarial Graphical Apprentice Games With Inverse Reinforcement Learning

FM Golmisheh, S Shamaghdari - IEEE Transactions on …, 2024 - ieeexplore.ieee.org

6 天前 - … In this paper, we introduce an apprentice learning approach in which the learner
agents follow the formation and trajectory of the expert agents. The importance of maintaining …

[PDF] onomazein.com

Improved Aircraft Trajectory Language Planning Strategy Based on Soft Actor Critic

Y Dong, Y Wang - Onomázein, 2024 - onomazein.com

8 天前 - … an innovative trajectory planning algorithm for stratospheric airships, combining
Artificial Potential Field (APF) method with Soft Actor-Critic (SAC) deep reinforcement learning …

Age of Information Aware Trajectory Planning of UAV

J Pan, Y Li, R Chai, S Xia, L Zuo - IEEE Transactions on …, 2024 - ieeexplore.ieee.org

9 天前 - … trajectory planning and resource allocation scheme based on hierarchical deep
reinforcement learning … sub-problems: high-level global trajectory planning and low-level local …

Deep reinforcement learning with predictive auxiliary task for autonomous train collision avoidance

A Plissonneau, L Jourdan, D Trentesaux, L Abdi… - … Rail Transport Planning …, 2024 - Elsevier

9 天前 - … consists of a deep reinforcement learning (DRL) based … a method for training a
reinforcement learning (RL) agent … to be predictive of obstacle trajectories. A comparison study …

[PDF][PDF] RoboGen: Autonomous Skill Acquisition in Robot Simulation Based on General Language Model and Reinforcement Learning

W Junyang, J Yibing, Z Jingdong, D Haowen… - me336.ancorasir.com

10 天前 - … RoboGen simultaneously employs motion planningbased primitives, gradient-based
trajectory optimization, and reinforcement learning to acquire skills. For each task, we …

Adaptive Distance Functions via Kelvin Transformation

RI Cabral Muchacho, FT Pokorny - arXiv e-prints, 2024 - ui.adsabs.harvard.edu

11 天前 - … to time-varying semantic information, and to perform queries in sub-microsecond,
enabling applications in reinforcement learning, trajectory optimization, and motion planning. …

Fast Collision-Free Multivehicle Lane Change Motion Planning and Control Framework in Uncertain Environments

T Liu, R Chai, S Chai, F Arvin, J Zhang… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org

13 天前 - … , various trajectory planning … reinforcement learning algorithm based on an
end-to-end mapless collision avoidance algorithm for training in the proposed distributed learning …

Do Robots Dream of Random Trees? Monte Carlo Tree Search for Dynamical, Partially Observable, and Multi-Agent Systems

BP Rivière - 2024 - thesis.library.caltech.edu

13 天前 - … to generate intelligent planned behavior. Although reinforcement learning methods
that … gap by developing algorithms that compute trajectories in real-time while converging …

[PDF] openreview.net

Reward-Free Kernel-Based Reinforcement Learning

S Vakili, F Nabiei, D Shiu, A Bernacchia - … on Machine Learning - openreview.net

13 天前 - … This assumption simplifies the problem compared to the rewardfree RL framework
considered in this work, where the agent must follow the MDP trajectory within each episode …

[PDF] arxiv.org

Adaptive Distance Functions via Kelvin Transformation

RIC Muchacho, FT Pokorny - arXiv preprint arXiv:2406.03200, 2024 - arxiv.org

14 天前 - … to adapt to time-varying semantic information, and to perform queries in ≤ 1 µs,
enabling applications in reinforcement learning, trajectory optimization, and motion planning. …

高级搜索

QQ 群

Optimal Robust Formation of Multi-Agent Systems as Adversarial Graphical Apprentice Games With Inverse Reinforcement Learning

Improved Aircraft Trajectory Language Planning Strategy Based on Soft Actor Critic

Age of Information Aware Trajectory Planning of UAV

Deep reinforcement learning with predictive auxiliary task for autonomous train collision avoidance

[PDF][PDF] RoboGen: Autonomous Skill Acquisition in Robot Simulation Based on General Language Model and Reinforcement Learning

Adaptive Distance Functions via Kelvin Transformation

Fast Collision-Free Multivehicle Lane Change Motion Planning and Control Framework in Uncertain Environments

Do Robots Dream of Random Trees? Monte Carlo Tree Search for Dynamical, Partially Observable, and Multi-Agent Systems

Reward-Free Kernel-Based Reinforcement Learning

Adaptive Distance Functions via Kelvin Transformation

引用