Risk aware and multi-objective decision making with distributional monte carlo tree search

CF Hayes, R Rădulescu, E Bargiacchi… - Autonomous Agents and …, 2022 - Springer

Real-world sequential decision-making tasks are generally complex, requiring trade-offs
between multiple, often conflicting, objectives. Despite this, the majority of research in …

被引用次数：265 相关文章所有 21 个版本

[PDF] arxiv.org

Sample-efficient multi-objective learning via generalized policy improvement prioritization

LN Alegre, ALC Bazzan, DM Roijers, A Nowé… - arXiv preprint arXiv …, 2023 - arxiv.org

Multi-objective reinforcement learning (MORL) algorithms tackle sequential decision
problems where agents may have different preferences over (possibly conflicting) reward …

被引用次数：16 相关文章所有 11 个版本

[PDF] ifaamas.org

[PDF][PDF] Distributional monte carlo tree search for risk-aware and multi-objective reinforcement learning

CF Hayes, M Reymond, DM Roijers, E Howley… - Proceedings of the 20th …, 2021 - ifaamas.org

In many risk-aware and multi-objective reinforcement learning settings, the utility of the user
is derived from the single execution of a policy. In these settings, making decisions based on …

被引用次数：24 相关文章所有 7 个版本

[HTML] springer.com

[HTML][HTML] Monte Carlo tree search algorithms for risk-aware and multi-objective reinforcement learning

CF Hayes, M Reymond, DM Roijers, E Howley… - Autonomous Agents and …, 2023 - Springer

In many risk-aware and multi-objective reinforcement learning settings, the utility of the user
is derived from a single execution of a policy. In these settings, making decisions based on …

被引用次数：7 相关文章所有 7 个版本

[HTML] springer.com

[HTML][HTML] Expected scalarised returns dominance: A new solution concept for multi-objective decision making

CF Hayes, T Verstraeten, DM Roijers, E Howley… - Neural Computing and …, 2022 - Springer

In many real-world scenarios, the utility of a user is derived from a single execution of a
policy. In this case, to apply multi-objective reinforcement learning, the expected utility of the …

被引用次数：16 相关文章所有 6 个版本

[PDF] liv.ac.uk

[PDF][PDF] Decision-theoretic planning for the expected scalarised returns

CF Hayes, DM Roijers, E Howley… - Proceedings of the 21st …, 2022 - ifmas.csc.liv.ac.uk

In sequential multi-objective decision making (MODeM) settings, when the utility of a user is
derived from a single execution of a policy, policies for the expected scalarised returns …

被引用次数：6 相关文章所有 8 个版本

[PDF] arxiv.org

Multi-objective coordination graphs for the expected scalarised returns with generative flow models

CF Hayes, T Verstraeten, DM Roijers, E Howley… - arXiv preprint arXiv …, 2022 - arxiv.org

Many real-world problems contain multiple objectives and agents, where a trade-off exists
between objectives. Key to solving such problems is to exploit sparse dependency …

被引用次数：4 相关文章所有 4 个版本

[PDF] researchgate.net

[PDF][PDF] Multi-objective distributional value iteration

CF Hayes, DM Roijers, E Howley… - Adaptive and Learning …, 2022 - researchgate.net

In sequential multi-objective decision making (MODeM) settings, when the utility of a user is
derived from a single execution of a policy, policies for the expected scalarised returns …

被引用次数：4 相关文章所有 2 个版本

[PDF] universityofgalway.ie

[PDF][PDF] Multi-objective decision making for trustworthy ai

P Mannion, F Heintz… - … of the Multi …, 2021 - modem2021.cs.universityofgalway …

If widespread deployment of AI systems is to be accepted by society in the future, it is crucial
that such systems are trustworthy. Trustworthiness for autonomous systems has a number of …

被引用次数：6 相关文章

Hybrid Surrogate Assisted Evolutionary Multiobjective Reinforcement Learning for Continuous Robot Control

A Mazumdar, V Kyrki - International Conference on the Applications of …, 2024 - Springer

Many real world reinforcement learning (RL) problems consist of multiple conflicting
objective functions that need to be optimized simultaneously. Finding these optimal policies …

被引用次数：1 相关文章所有 4 个版本

高级搜索

QQ 群