Temporal logic motion control using actor–critic methods

H Kress-Gazit, M Lahijanian… - Annual Review of Control …, 2018 - annualreviews.org

Robot control for tasks such as moving around obstacles or grasping objects has advanced
significantly in the last few decades. However, controlling robots to perform complex tasks is …

被引用次数：178 相关文章所有 6 个版本

[PDF] arxiv.org

Reinforcement learning for temporal logic control synthesis with probabilistic satisfaction guarantees

M Hasanbeig, Y Kantaros, A Abate… - 2019 IEEE 58th …, 2019 - ieeexplore.ieee.org

We present a model-free reinforcement learning algorithm to synthesize control policies that
maximize the probability of satisfying high-level control objectives given as Linear Temporal …

被引用次数：137 相关文章所有 14 个版本

[PDF] arxiv.org

Modular deep reinforcement learning for continuous motion planning with temporal logic

M Cai, M Hasanbeig, S Xiao, A Abate… - IEEE Robotics and …, 2021 - ieeexplore.ieee.org

This letter investigates the motion planning of autonomous dynamical systems modeled by
Markov decision processes (MDP) with unknown transition probabilities over continuous …

被引用次数：81 相关文章所有 15 个版本

[PDF] github.io

Learning-based probabilistic LTL motion planning with environment and motion uncertainties

M Cai, H Peng, Z Li, Z Kan - IEEE Transactions on Automatic …, 2020 - ieeexplore.ieee.org

This article considers control synthesis of an autonomous agent with linear temporal logic
(LTL) specifications subject to environment and motion uncertainties. Specifically, the …

被引用次数：54 相关文章所有 5 个版本

[PDF] researchgate.net

Policy-based reinforcement learning for time series anomaly detection

M Yu, S Sun - Engineering Applications of Artificial Intelligence, 2020 - Elsevier

Time series anomaly detection has become a crucial and challenging task driven by the
rapid increase of streaming data with the arrival of the Internet of Things. Existing methods …

被引用次数：60 相关文章所有 2 个版本

[PDF] researchgate.net

Reduced variance deep reinforcement learning with temporal logic specifications

Q Gao, D Hajinezhad, Y Zhang, Y Kantaros… - Proceedings of the 10th …, 2019 - dl.acm.org

In this paper, we propose a model-free reinforcement learning method to synthesize control
policies for mobile robots modeled as Markov Decision Process (MDP) with unknown …

被引用次数：55 相关文章所有 5 个版本

[PDF] arxiv.org

Optimal probabilistic motion planning with potential infeasible LTL constraints

M Cai, S Xiao, Z Li, Z Kan - IEEE transactions on automatic …, 2021 - ieeexplore.ieee.org

This paper studies optimal motion planning subject to motion and environment uncertainties.
By modeling the system as a probabilistic labeled Markov decision process (PL-MDP), the …

被引用次数：35 相关文章所有 11 个版本

[PDF] arxiv.org

Reinforcement learning based temporal logic control with maximum probabilistic satisfaction

M Cai, S Xiao, B Li, Z Li, Z Kan - 2021 IEEE international …, 2021 - ieeexplore.ieee.org

This paper presents a model-free reinforcement learning (RL) algorithm to synthesize a
control policy that maximizes the satisfaction probability of complex tasks, which are …

被引用次数：32 相关文章所有 12 个版本

[PDF] arxiv.org

Formal policy synthesis for continuous-state systems via reinforcement learning

M Kazemi, S Soudjani - … Methods: 16th International Conference, IFM 2020 …, 2020 - Springer

This paper studies satisfaction of temporal properties on unknown stochastic processes that
have continuous state spaces. We show how reinforcement learning (RL) can be applied for …

被引用次数：32 相关文章所有 4 个版本

[PDF] arxiv.org

Hierarchical motion planning under probabilistic temporal tasks and safe-return constraints

M Guo, T Liao, J Wang, Z Li - IEEE Transactions on Automatic …, 2023 - ieeexplore.ieee.org

Safety is crucial for robotic missions within an uncertain environment. Common safety
requirements such as collision avoidance are only state-dependent, which can be restrictive …

被引用次数：3 相关文章所有 4 个版本

高级搜索

QQ 群