Point-based value iteration for constrained POMDPs

Deep reinforcement learning driven inspection and maintenance planning under incomplete information and constraints

CP Andriotis, KG Papakonstantinou - Reliability Engineering & System …, 2021 - Elsevier

Determination of inspection and maintenance policies for minimizing long-term risks and
costs in deteriorating engineering environments constitutes a complex optimization problem …

被引用次数：142 相关文章所有 11 个版本

Online tree-based planning for active spacecraft fault estimation and collision avoidance

J Ragan, B Riviere, FY Hadaegh, SJ Chung - Science Robotics, 2024 - science.org

Autonomous robots operating in uncertain or hazardous environments subject to state safety
constraints must be able to identify and isolate faulty components in a time-optimal manner …

被引用次数：2 相关文章所有 3 个版本

[PDF] aaai.org

Learning with safety constraints: Sample complexity of reinforcement learning for constrained mdps

A HasanzadeZonuzy, A Bura, D Kalathil… - Proceedings of the …, 2021 - ojs.aaai.org

Many physical systems have underlying safety considerations that require that the policy
employed ensures the satisfaction of a set of constraints. The analytical formulation usually …

被引用次数：48 相关文章所有 7 个版本

[PDF] neurips.cc

Monte-Carlo tree search for constrained POMDPs

J Lee, GH Kim, P Poupart… - Advances in Neural …, 2018 - proceedings.neurips.cc

Abstract Monte-Carlo Tree Search (MCTS) has been successfully applied to very large
POMDPs, a standard model for stochastic sequential decision-making problems. However …

被引用次数：78 相关文章所有 10 个版本

[PDF] aaai.org

Rao*: An algorithm for chance-constrained pomdp's

S Thiébaux, B Williams - Proceedings of the AAAI Conference on …, 2016 - ojs.aaai.org

Autonomous agents operating in partially observable stochastic environments often face the
problem of optimizing expected performance while bounding the risk of violating safety …

被引用次数：95 相关文章所有 13 个版本

[PDF] jair.org Full View

Constrained multiagent Markov decision processes: A taxonomy of problems and algorithms

F De Nijs, E Walraven, M De Weerdt, M Spaan - Journal of Artificial …, 2021 - jair.org

In domains such as electric vehicle charging, smart distribution grids and autonomous
warehouses, multiple agents share the same resources. When planning the use of these …

被引用次数：38 相关文章所有 12 个版本

[PDF] aaai.org

Approximate linear programming for constrained partially observable Markov decision processes

P Poupart, A Malhotra, P Pei, KE Kim, B Goh… - Proceedings of the …, 2015 - ojs.aaai.org

In many situations, it is desirable to optimize a sequence of decisions by maximizing a
primary objective while respecting some constraints with respect to secondary objectives …

被引用次数：81 相关文章所有 15 个版本

[PDF] aaai.org

Online planning for constrained POMDPs with continuous spaces through dual ascent

A Jamgochian, A Corso, MJ Kochenderfer - Proceedings of the …, 2023 - ojs.aaai.org

Rather than augmenting rewards with penalties for undesired behavior, Constrained
Partially Observable Markov Decision Processes (CPOMDPs) plan safely by imposing …

被引用次数：12 相关文章所有 5 个版本

Cognitive radar for classification

S Brüggenwirth, M Warnke, S Wagner… - IEEE Aerospace and …, 2019 - ieeexplore.ieee.org

The article presents the cognitive radar architecture of Fraunhofer FHR based on a three-
layer model of human cognitive performance. The approach is illustrated using examples for …

被引用次数：37 相关文章所有 2 个版本

[PDF] openreview.net

Safe reinforcement learning from pixels using a stochastic latent representation

Y Hogewind, TD Simao, T Kachman… - … Conference on Learning …, 2022 - openreview.net

We address the problem of safe reinforcement learning from pixel observations. Inherent
challenges in such settings are (1) a trade-off between reward optimization and adhering to …

被引用次数：16 相关文章所有 5 个版本

高级搜索

QQ 群