Meta-learning in neural networks: A survey

T Hospedales, A Antoniou, P Micaelli… - IEEE transactions on …, 2021 - ieeexplore.ieee.org
The field of meta-learning, or learning-to-learn, has seen a dramatic rise in interest in recent
years. Contrary to conventional approaches to AI where tasks are solved from scratch using …

On implicit bias in overparameterized bilevel optimization

P Vicol, JP Lorraine, F Pedregosa… - International …, 2022 - proceedings.mlr.press
Many problems in machine learning involve bilevel optimization (BLO), including
hyperparameter optimization, meta-learning, and dataset distillation. Bilevel problems …

Unbiased gradient estimation in unrolled computation graphs with persistent evolution strategies

P Vicol, L Metz, J Sohl-Dickstein - … Conference on Machine …, 2021 - proceedings.mlr.press
Unrolled computation graphs arise in many scenarios, including training RNNs, tuning
hyperparameters through unrolled optimization, and training learned optimizers. Current …

Bidirectional learning for offline model-based biological sequence design

C Chen, Y Zhang, X Liu… - … Conference on Machine …, 2023 - proceedings.mlr.press
Offline model-based optimization aims to maximize a black-box objective function with a
static dataset of designs and their scores. In this paper, we focus on biological sequence …

Evograd: Efficient gradient-based meta-learning and hyperparameter optimization

O Bohdal, Y Yang… - Advances in neural …, 2021 - proceedings.neurips.cc
Gradient-based meta-learning and hyperparameter optimization have seen significant
progress recently, enabling practical end-to-end training of neural networks together with …

Online meta-critic learning for off-policy actor-critic methods

W Zhou, Y Li, Y Yang, H Wang… - Advances in neural …, 2020 - proceedings.neurips.cc
Abstract Off-Policy Actor-Critic (OffP-AC) methods have proven successful in a variety of
continuous control tasks. Normally, the critic's action-value function is updated using …

Continuous-time meta-learning with forward mode differentiation

T Deleu, D Kanaa, L Feng, G Kerg, Y Bengio… - arXiv preprint arXiv …, 2022 - arxiv.org
Drawing inspiration from gradient-based meta-learning methods with infinitely small
gradient steps, we introduce Continuous-Time Meta-Learning (COMLN), a meta-learning …

Low-variance gradient estimation in unrolled computation graphs with es-single

P Vicol - International Conference on Machine Learning, 2023 - proceedings.mlr.press
We propose an evolution strategies-based algorithm for estimating gradients in unrolled
computation graphs, called ES-Single. Similarly to the recently-proposed Persistent …

Amortized proximal optimization

J Bae, P Vicol, JZ HaoChen… - Advances in Neural …, 2022 - proceedings.neurips.cc
We propose a framework for online meta-optimization of parameters that govern
optimization, called Amortized Proximal Optimization (APO). We first interpret various …

Meta-Learning for Wireless Communications: A Survey and a Comparison to GNNs

B Zhao, J Wu, Y Ma, C Yang - IEEE Open Journal of the …, 2024 - ieeexplore.ieee.org
Deep learning has been used for optimizing a multitude of wireless problems. Yet most
existing works assume that training and test samples are drawn from the same distribution …