- 学术资源搜索

A survey of multi-objective sequential decision-making

DM Roijers, P Vamplew, S Whiteson… - Journal of Artificial …, 2013 - jair.org

Sequential decision-making problems with multiple objectives arise naturally in practice and
pose unique challenges for research in decision-theoretic planning and learning, which has …

被引用次数：804 相关文章所有 21 个版本

[PDF] aaai.org

Safe reinforcement learning via shielding under partial observability

S Carr, N Jansen, S Junges, U Topcu - Proceedings of the AAAI …, 2023 - ojs.aaai.org

Safe exploration is a common problem in reinforcement learning (RL) that aims to prevent
agents from making disastrous decisions while exploring their environment. A family of …

被引用次数：43 相关文章所有 8 个版本

[PDF] neurips.cc

Bq-nco: Bisimulation quotienting for efficient neural combinatorial optimization

D Drakulic, S Michel, F Mai, A Sors… - Advances in Neural …, 2024 - proceedings.neurips.cc

Despite the success of neural-based combinatorial optimization methods for end-to-end
heuristic learning, out-of-distribution generalization remains a challenge. In this paper, we …

被引用次数：25 相关文章所有 5 个版本

[PDF] aaai.org

Simulated penetration testing: From" dijkstra" to" turing test++"

J Hoffmann - Proceedings of the international conference on …, 2015 - ojs.aaai.org

Penetration testing (pentesting) is a well established method for identifying security
weaknesses, by conducting friendly attacks. Simulated pentesting automates this process …

被引用次数：145 相关文章所有 8 个版本

[PDF] ox.ac.uk

[图书][B] Multi-objective decision making

DM Roijers, S Whiteson, R Brachman, P Stone - 2017 - Springer

Many real-world decision problems have multiple objectives. For example, when choosing a
medical treatment plan, we want to maximize the efficacy of the treatment, but also minimize …

被引用次数：114 相关文章所有 10 个版本

[PDF] sagepub.com

Probabilistic planning with formal performance guarantees for mobile service robots

B Lacerda, F Faruq, D Parker… - … International Journal of …, 2019 - journals.sagepub.com

We present a framework for mobile service robot task planning and execution, based on the
use of probabilistic verification techniques for the generation of optimal policies with …

被引用次数：79 相关文章所有 17 个版本

[PDF] aaai.org

Ethically compliant sequential decision making

J Svegliato, SB Nashed, S Zilberstein - Proceedings of the AAAI …, 2021 - ojs.aaai.org

Enabling autonomous systems to comply with an ethical theory is critical given their
accelerating deployment in domains that impact society. While many ethical theories have …

被引用次数：45 相关文章所有 7 个版本

[PDF] mlr.press

Symbolic network: generalized neural policies for relational MDPs

S Garg, A Bajpai - International Conference on Machine …, 2020 - proceedings.mlr.press

Abstract A Relational Markov Decision Process (RMDP) is a first-order representation to
express all instances of a single probabilistic planning domain with possibly unbounded …

被引用次数：55 相关文章所有 7 个版本

[PDF] sciencedirect.com

Safe path planning for UAV urban operation under GNSS signal occlusion risk

JA Delamer, Y Watanabe, CPC Chanel - Robotics and Autonomous …, 2021 - Elsevier

This paper introduces a concept of safe path planning for UAV's autonomous operation in an
urban environment where GNSS-positioning may become unreliable or even unavailable. If …

被引用次数：33 相关文章所有 6 个版本

[PDF] arxiv.org

Relational abstractions for generalized reinforcement learning on symbolic problems

R Karia, S Srivastava - arXiv preprint arXiv:2204.12665, 2022 - arxiv.org

Reinforcement learning in problems with symbolic state spaces is challenging due to the
need for reasoning over long horizons. This paper presents a new approach that utilizes …

被引用次数：21 相关文章所有 8 个版本

高级搜索

QQ 群

A survey of multi-objective sequential decision-making

Safe reinforcement learning via shielding under partial observability

Bq-nco: Bisimulation quotienting for efficient neural combinatorial optimization

Simulated penetration testing: From" dijkstra" to" turing test++"

[图书][B] Multi-objective decision making

Probabilistic planning with formal performance guarantees for mobile service robots

Ethically compliant sequential decision making

Symbolic network: generalized neural policies for relational MDPs

Safe path planning for UAV urban operation under GNSS signal occlusion risk

Relational abstractions for generalized reinforcement learning on symbolic problems

引用