Synthesis for robots: Guarantees and feedback for robot behavior

H Kress-Gazit, M Lahijanian… - Annual Review of Control …, 2018 - annualreviews.org
Robot control for tasks such as moving around obstacles or grasping objects has advanced
significantly in the last few decades. However, controlling robots to perform complex tasks is …

Reinforcement learning for temporal logic control synthesis with probabilistic satisfaction guarantees

M Hasanbeig, Y Kantaros, A Abate… - 2019 IEEE 58th …, 2019 - ieeexplore.ieee.org
We present a model-free reinforcement learning algorithm to synthesize control policies that
maximize the probability of satisfying high-level control objectives given as Linear Temporal …

Modular deep reinforcement learning for continuous motion planning with temporal logic

M Cai, M Hasanbeig, S Xiao, A Abate… - IEEE Robotics and …, 2021 - ieeexplore.ieee.org
This letter investigates the motion planning of autonomous dynamical systems modeled by
Markov decision processes (MDP) with unknown transition probabilities over continuous …

Learning-based probabilistic LTL motion planning with environment and motion uncertainties

M Cai, H Peng, Z Li, Z Kan - IEEE Transactions on Automatic …, 2020 - ieeexplore.ieee.org
This article considers control synthesis of an autonomous agent with linear temporal logic
(LTL) specifications subject to environment and motion uncertainties. Specifically, the …

Policy-based reinforcement learning for time series anomaly detection

M Yu, S Sun - Engineering Applications of Artificial Intelligence, 2020 - Elsevier
Time series anomaly detection has become a crucial and challenging task driven by the
rapid increase of streaming data with the arrival of the Internet of Things. Existing methods …

Reduced variance deep reinforcement learning with temporal logic specifications

Q Gao, D Hajinezhad, Y Zhang, Y Kantaros… - Proceedings of the 10th …, 2019 - dl.acm.org
In this paper, we propose a model-free reinforcement learning method to synthesize control
policies for mobile robots modeled as Markov Decision Process (MDP) with unknown …

Optimal probabilistic motion planning with potential infeasible LTL constraints

M Cai, S Xiao, Z Li, Z Kan - IEEE transactions on automatic …, 2021 - ieeexplore.ieee.org
This paper studies optimal motion planning subject to motion and environment uncertainties.
By modeling the system as a probabilistic labeled Markov decision process (PL-MDP), the …

Reinforcement learning based temporal logic control with maximum probabilistic satisfaction

M Cai, S Xiao, B Li, Z Li, Z Kan - 2021 IEEE international …, 2021 - ieeexplore.ieee.org
This paper presents a model-free reinforcement learning (RL) algorithm to synthesize a
control policy that maximizes the satisfaction probability of complex tasks, which are …

Formal policy synthesis for continuous-state systems via reinforcement learning

M Kazemi, S Soudjani - … Methods: 16th International Conference, IFM 2020 …, 2020 - Springer
This paper studies satisfaction of temporal properties on unknown stochastic processes that
have continuous state spaces. We show how reinforcement learning (RL) can be applied for …

Hierarchical motion planning under probabilistic temporal tasks and safe-return constraints

M Guo, T Liao, J Wang, Z Li - IEEE Transactions on Automatic …, 2023 - ieeexplore.ieee.org
Safety is crucial for robotic missions within an uncertain environment. Common safety
requirements such as collision avoidance are only state-dependent, which can be restrictive …