Skilled motor control of an inverted pendulum implies low entropy of states but high entropy of actions

N Catenacci Volpi, M Greaves… - PLoS Computational …, 2023 - journals.plos.org
The mastery of skills, such as balancing an inverted pendulum, implies a very accurate
control of movements to achieve the task goals. Traditional accounts of skilled action control …

[PDF][PDF] Beyond Surprise: Improving Exploration Through Surprise Novelty.

H Le, K Do, D Nguyen, S Venkatesh - AAMAS, 2024 - thaihungle.github.io
What motivates agents to explore? Successfully answering this question would enable
agents to learn efficiently in formidable tasks. Random explorations such as 𝜖-greedy are …

Intrinsic Motivation via Surprise Memory

H Le, K Do, D Nguyen, S Venkatesh - arXiv preprint arXiv:2308.04836, 2023 - arxiv.org
We present a new computing model for intrinsic rewards in reinforcement learning that
addresses the limitations of existing surprise-driven explorations. The reward is the novelty …

Learning Representations of Cognitive Dynamics and Decision Making in Human Drivers

R Wei - 2023 - search.proquest.com
In order to function safely and autonomously, modern robotic systems need to understand
other agents' mental states, including their beliefs and desires about the shared …

Skilled motor control implies a low entropy of states but a high entropy of actions

NC Volpi, M Greaves, D Trendafilov, C Salge… - arXiv preprint arXiv …, 2021 - arxiv.org
The mastery of skills such as playing tennis or balancing an inverted pendulum implies a
very accurate control of movements to achieve the task goals. Traditional accounts of skilled …

[图书][B] Building RL Algorithms that Generalize: From Latent Dynamics Models to Meta-Learning

JD Co-Reyes - 2021 - search.proquest.com
Building general purpose RL algorithms that can efficiently solve a wide variety of problems
will require encoding the right structure and representations into our models. A key …