A survey of inverse reinforcement learning: Challenges, methods and progress

X Wu, L Xiao, Y Sun, J Zhang, T Ma, L He - Future Generation Computer …, 2022 - Elsevier

Abstract Machine learning has become the state-of-the-art technique for many tasks
including computer vision, natural language processing, speech processing tasks, etc …

被引用次数：635 相关文章所有 6 个版本

[PDF] arxiv.org

Ai alignment: A comprehensive survey

J Ji, T Qiu, B Chen, B Zhang, H Lou, K Wang… - arXiv preprint arXiv …, 2023 - arxiv.org

AI alignment aims to make AI systems behave in line with human intentions and values. As
AI systems grow more capable, the potential large-scale risks associated with misaligned AI …

被引用次数：220 相关文章所有 3 个版本

[PDF] jair.org Full View

A survey of zero-shot generalisation in deep reinforcement learning

R Kirk, A Zhang, E Grefenstette, T Rocktäschel - Journal of Artificial …, 2023 - jair.org

The study of zero-shot generalisation (ZSG) in deep Reinforcement Learning (RL) aims to
produce RL algorithms whose policies generalise well to novel unseen situations at …

被引用次数：408 相关文章所有 9 个版本

[PDF] acm.org

Five facets of 6G: Research challenges and opportunities

LH Shen, KT Feng, L Hanzo - ACM Computing Surveys, 2023 - dl.acm.org

While the fifth-generation systems are being rolled out across the globe, researchers have
turned their attention to the exploration of radical next-generation solutions. At this early …

被引用次数：124 相关文章所有 7 个版本

[PDF] nsf.gov

In situ bidirectional human-robot value alignment

L Yuan, X Gao, Z Zheng, M Edmonds, YN Wu… - Science robotics, 2022 - science.org

A prerequisite for social coordination is bidirectional communication between teammates,
each playing two roles simultaneously: as receptive listeners and expressive speakers. For …

被引用次数：89 相关文章所有 6 个版本

Reinforcement learning for intelligent healthcare applications: A survey

A Coronato, M Naeem, G De Pietro… - Artificial intelligence in …, 2020 - Elsevier

Discovering new treatments and personalizing existing ones is one of the major goals of
modern clinical research. In the last decade, Artificial Intelligence (AI) has enabled the …

被引用次数：320 相关文章所有 5 个版本

[PDF] arxiv.org

A review of cooperative multi-agent deep reinforcement learning

A Oroojlooy, D Hajinezhad - Applied Intelligence, 2023 - Springer

Abstract Deep Reinforcement Learning has made significant progress in multi-agent
systems in recent years. The aim of this review article is to provide an overview of recent …

被引用次数：511 相关文章所有 8 个版本

Transferring policy of deep reinforcement learning from simulation to reality for robotics

H Ju, R Juan, R Gomez, K Nakamura… - Nature Machine …, 2022 - nature.com

Deep reinforcement learning has achieved great success in many fields and has shown
promise in learning robust skills for robot control in recent years. However, sampling …

被引用次数：76 相关文章所有 2 个版本

[PDF] harvard.edu

Deep reinforcement learning for transportation network combinatorial optimization: A survey

Q Wang, C Tang - Knowledge-Based Systems, 2021 - Elsevier

Traveling salesman and vehicle routing problems with their variants, as classic
combinatorial optimization problems, have attracted considerable attention for decades of …

被引用次数：127 相关文章所有 4 个版本

[PDF] mlr.press

Extrapolating beyond suboptimal demonstrations via inverse reinforcement learning from observations

D Brown, W Goo, P Nagarajan… - … conference on machine …, 2019 - proceedings.mlr.press

A critical flaw of existing inverse reinforcement learning (IRL) methods is their inability to
significantly outperform the demonstrator. This is because IRL typically seeks a reward …

被引用次数：434 相关文章所有 12 个版本

高级搜索

QQ 群