Generating behaviorally diverse policies with latent diffusion models

S Hegde, S Batra, KR Zentner… - Advances in Neural …, 2023 - proceedings.neurips.cc
Abstract Recent progress in Quality Diversity Reinforcement Learning (QD-RL) has enabled
learning a collection of behaviorally diverse, high performing policies. However, these …

Density descent for diversity optimization

DH Lee, A Palaparthi, MC Fontaine, B Tjanaka… - Proceedings of the …, 2024 - dl.acm.org
Diversity optimization seeks to discover a set of solutions that elicit diverse features. Prior
work has proposed Novelty Search (NS), which, given a current set of solutions, seeks to …

Bayesian Optimisation for Quality Diversity Search with coupled descriptor functions

P Kent, A Gaier, JB Mouret… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Quality Diversity (QD) algorithms such as MAP-Elites are a class of optimisation techniques
that attempt to find many high performing points that all behave differently according to a …

Hyperppo: A scalable method for finding small policies for robotic control

S Hegde, Z Huang, GS Sukhatme - 2024 IEEE International …, 2024 - ieeexplore.ieee.org
Models with fewer parameters are necessary for the neural control of memory-limited,
performant robots. Finding these smaller neural network architectures can be time …

Adaptive generative adversarial maximum entropy inverse reinforcement learning

L Song, D Li, X Xu - Information Sciences, 2025 - Elsevier
Maximum entropy inverse reinforcement learning algorithms have been extensively studied
for learning rewards and optimizing policies using expert demonstrations. However, high …

Imitation from Diverse Behaviors: Wasserstein Quality Diversity Imitation Learning with Single-Step Archive Exploration

X Yu, Z Wan, DM Bossens, Y Lyu, Q Guo… - arXiv preprint arXiv …, 2024 - arxiv.org
Learning diverse and high-performance behaviors from a limited set of demonstrations is a
grand challenge. Traditional imitation learning methods usually fail in this task because most …

Quality Diversity Imitation Learning

Z Wan, X Yu, DM Bossens, Y Lyu, Q Guo… - arXiv preprint arXiv …, 2024 - arxiv.org
Imitation learning (IL) has shown great potential in various applications, such as robot
control. However, traditional IL methods are usually designed to learn only one specific type …

Scaling Policy Gradient Quality-Diversity with Massive Parallelization via Behavioral Variations

K Mitsides, M Faldor, A Cully - arXiv preprint arXiv:2501.18723, 2025 - arxiv.org
Quality-Diversity optimization comprises a family of evolutionary algorithms aimed at
generating a collection of diverse and high-performing solutions. MAP-Elites (ME), a notable …

Skill-Conditioned Policy Optimization with Successor Features Representations

L Grillotti, M Faldor, BG León, A Cully - Second Agent Learning in …, 2023 - openreview.net
A key aspect of intelligence is the ability to exhibit a wide range of behaviors to adapt to
unforeseen situations. Designing artificial agents that are capable of showcasing a broad …

Quality Diversity for Robot Learning: Limitations and Future Directions

S Batra, B Tjanaka, S Nikolaidis… - Proceedings of the Genetic …, 2024 - dl.acm.org
Quality Diversity (QD) has shown great success in discovering high-performing, diverse
policies for robot skill learning. While current benchmarks have led to the development of …