A survey of multi-objective sequential decision-making

DM Roijers, P Vamplew, S Whiteson… - Journal of Artificial …, 2013 - jair.org
Sequential decision-making problems with multiple objectives arise naturally in practice and
pose unique challenges for research in decision-theoretic planning and learning, which has …

Safe reinforcement learning via shielding under partial observability

S Carr, N Jansen, S Junges, U Topcu - Proceedings of the AAAI …, 2023 - ojs.aaai.org
Safe exploration is a common problem in reinforcement learning (RL) that aims to prevent
agents from making disastrous decisions while exploring their environment. A family of …

Bq-nco: Bisimulation quotienting for efficient neural combinatorial optimization

D Drakulic, S Michel, F Mai, A Sors… - Advances in Neural …, 2024 - proceedings.neurips.cc
Despite the success of neural-based combinatorial optimization methods for end-to-end
heuristic learning, out-of-distribution generalization remains a challenge. In this paper, we …

Simulated penetration testing: From" dijkstra" to" turing test++"

J Hoffmann - Proceedings of the international conference on …, 2015 - ojs.aaai.org
Penetration testing (pentesting) is a well established method for identifying security
weaknesses, by conducting friendly attacks. Simulated pentesting automates this process …

[图书][B] Multi-objective decision making

DM Roijers, S Whiteson, R Brachman, P Stone - 2017 - Springer
Many real-world decision problems have multiple objectives. For example, when choosing a
medical treatment plan, we want to maximize the efficacy of the treatment, but also minimize …

Probabilistic planning with formal performance guarantees for mobile service robots

B Lacerda, F Faruq, D Parker… - … International Journal of …, 2019 - journals.sagepub.com
We present a framework for mobile service robot task planning and execution, based on the
use of probabilistic verification techniques for the generation of optimal policies with …

Ethically compliant sequential decision making

J Svegliato, SB Nashed, S Zilberstein - Proceedings of the AAAI …, 2021 - ojs.aaai.org
Enabling autonomous systems to comply with an ethical theory is critical given their
accelerating deployment in domains that impact society. While many ethical theories have …

Symbolic network: generalized neural policies for relational MDPs

S Garg, A Bajpai - International Conference on Machine …, 2020 - proceedings.mlr.press
Abstract A Relational Markov Decision Process (RMDP) is a first-order representation to
express all instances of a single probabilistic planning domain with possibly unbounded …

Safe path planning for UAV urban operation under GNSS signal occlusion risk

JA Delamer, Y Watanabe, CPC Chanel - Robotics and Autonomous …, 2021 - Elsevier
This paper introduces a concept of safe path planning for UAV's autonomous operation in an
urban environment where GNSS-positioning may become unreliable or even unavailable. If …

Relational abstractions for generalized reinforcement learning on symbolic problems

R Karia, S Srivastava - arXiv preprint arXiv:2204.12665, 2022 - arxiv.org
Reinforcement learning in problems with symbolic state spaces is challenging due to the
need for reasoning over long horizons. This paper presents a new approach that utilizes …