Observational learning by reinforcement learning

[HTML][HTML] Reward is enough

D Silver, S Singh, D Precup, RS Sutton - Artificial Intelligence, 2021 - Elsevier

In this article we hypothesise that intelligence, and its associated abilities, can be
understood as subserving the maximisation of reward. Accordingly, reward is enough to …

被引用次数：574 相关文章所有 8 个版本

[PDF] arxiv.org

Machine culture

L Brinkmann, F Baumann, JF Bonnefon… - Nature Human …, 2023 - nature.com

The ability of humans to create and disseminate culture is often credited as the single most
important factor of our success as a species. In this Perspective, we explore the notion of …

被引用次数：47 相关文章所有 22 个版本

[PDF] arxiv.org

A social path to human-like artificial intelligence

EA Duéñez-Guzmán, S Sadedin, JX Wang… - Nature Machine …, 2023 - nature.com

Traditionally, cognitive and computer scientists have viewed intelligence solipsistically, as a
property of unitary agents devoid of social context. Given the success of contemporary …

被引用次数：18 相关文章所有 4 个版本

[PDF] nature.com

Learning few-shot imitation as cultural transmission

A Bhoopchand, B Brownfield, A Collister… - Nature …, 2023 - nature.com

Cultural transmission is the domain-general social skill that allows agents to acquire and
use information from each other in real-time with high fidelity and recall. It can be thought of …

被引用次数：9 相关文章所有 9 个版本

[PDF] mlr.press

Emergent social learning via multi-agent reinforcement learning

KK Ndousse, D Eck, S Levine… - … conference on machine …, 2021 - proceedings.mlr.press

Social learning is a key component of human and animal intelligence. By taking cues from
the behavior of experts in their environment, social learners can acquire sophisticated …

被引用次数：64 相关文章所有 6 个版本

[PDF] arxiv.org

Imitating interactive intelligence

J Abramson, A Ahuja, I Barr, A Brussee… - arXiv preprint arXiv …, 2020 - arxiv.org

A common vision from science fiction is that robots will one day inhabit our physical spaces,
sense the world as we do, assist our physical labours, and communicate with us through …

被引用次数：69 相关文章所有 2 个版本

[PDF] mlr.press

Conditioned reinforcement learning for few-shot imitation

CR Dance, J Perez, T Cachet - International Conference on …, 2021 - proceedings.mlr.press

In few-shot imitation, an agent is given a few demonstrations of a previously unseen task,
and must then successfully perform that task. We propose a novel approach to learning few …

被引用次数：21 相关文章所有 5 个版本

[PDF] mlr.press

Psiphi-learning: Reinforcement learning with demonstrations using successor features and inverse temporal difference learning

A Filos, C Lyle, Y Gal, S Levine… - International …, 2021 - proceedings.mlr.press

We study reinforcement learning (RL) with no-reward demonstrations, a setting in which an
RL agent has access to additional data from the interaction of other agents with the same …

被引用次数：29 相关文章所有 6 个版本

[PDF] iospress.com

Reinforcement learning for personalization: A systematic literature review

F Den Hengst, EM Grua, A el Hassouni… - Data …, 2020 - content.iospress.com

The major application areas of reinforcement learning (RL) have traditionally been game
playing and continuous control. In recent years, however, RL has been increasingly applied …

被引用次数：45 相关文章所有 13 个版本

[HTML] sciencedirect.com

[HTML][HTML] Fast and accurate data-driven goal recognition using process mining techniques

Z Su, A Polyvyanyy, N Lipovetzky, S Sardiña… - Artificial Intelligence, 2023 - Elsevier

The problem of goal recognition requests to automatically infer an accurate probability
distribution over possible goals an autonomous agent is attempting to achieve in the …

被引用次数：6 相关文章所有 5 个版本

高级搜索

QQ 群