Decision-theoretic planning: Structural assumptions and computational leverage

MK Ho, R Saxe, F Cushman - Trends in Cognitive Sciences, 2022 - cell.com

Understanding Theory of Mind should begin with an analysis of the problems it solves. The
traditional answer is that Theory of Mind is used for predicting others' thoughts and actions …

被引用次数：91 相关文章所有 12 个版本

[PDF] arxiv.org

An overview of multi-agent reinforcement learning from game theoretical perspective

Y Yang, J Wang - arXiv preprint arXiv:2011.00583, 2020 - arxiv.org

Following the remarkable success of the AlphaGO series, 2019 was a booming year that
witnessed significant advances in multi-agent reinforcement learning (MARL) techniques …

被引用次数：297 相关文章所有 2 个版本

[PDF] tandfonline.com

[图书][B] Reinforcement Learning and Stochastic Optimization: A Unified Framework for Sequential Decisions: by Warren B. Powell (ed.), Wiley (2022). Hardback. ISBN …

I Halperin - 2022 - Taylor & Francis

What is reinforcement learning? How is reinforcement learning different from stochastic
optimization? And finally, can it be used for applications to quantitative finance for my current …

被引用次数：160 相关文章所有 6 个版本

[PDF] fransoliehoek.net

[图书][B] A concise introduction to decentralized POMDPs

FA Oliehoek, C Amato - 2016 - Springer

This book presents an overview of formal decision making methods for decentralized
cooperative systems. It is aimed at graduate students and researchers in the fields of …

被引用次数：1213 相关文章所有 13 个版本

[PDF] thecvf.com

Learning to drive from a world on rails

D Chen, V Koltun, P Krähenbühl - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com

We learn an interactive vision-based driving policy from pre-recorded driving logs via a
model-based approach. A forward model of the world supervises a driving policy that …

被引用次数：107 相关文章所有 11 个版本

[PDF] jair.org

A survey of multi-objective sequential decision-making

DM Roijers, P Vamplew, S Whiteson… - Journal of Artificial …, 2013 - jair.org

Sequential decision-making problems with multiple objectives arise naturally in practice and
pose unique challenges for research in decision-theoretic planning and learning, which has …

被引用次数：751 相关文章所有 21 个版本

Statistical relational artificial intelligence: Logic, probability, and computation

LD Raedt, K Kersting, S Natarajan, D Poole - Synthesis lectures on …, 2016 - Springer

An intelligent agent interacting with the real world will encounter individual people, courses,
test results, drugs prescriptions, chairs, boxes, etc., and needs to reason about properties of …

被引用次数：372 相关文章所有 7 个版本

[PDF] psl.eu

[图书][B] Probabilistic graphical models: principles and techniques

D Koller, N Friedman - 2009 - books.google.com

A general framework for constructing and using probabilistic models of complex systems that
would enable a computer to use available information for making decisions. Most tasks …

被引用次数：11134 相关文章所有 13 个版本

[HTML] sciencedirect.com

[HTML][HTML] Deliberation for autonomous robots: A survey

F Ingrand, M Ghallab - Artificial Intelligence, 2017 - Elsevier

Autonomous robots facing a diversity of open environments and performing a variety of tasks
and interactions need explicit deliberation in order to fulfill their missions. Deliberation is …

被引用次数：387 相关文章所有 9 个版本

[PDF] springer.com

On the convergence of projective-simulation–based reinforcement learning in Markov decision processes

WL Boyajian, J Clausen, LM Trenkwalder… - Quantum machine …, 2020 - Springer

In recent years, the interest in leveraging quantum effects for enhancing machine learning
tasks has significantly increased. Many algorithms speeding up supervised and …

被引用次数：756 相关文章所有 16 个版本

高级搜索

QQ 群