A kernel-based view of language model fine-tuning

S Malladi, A Wettig, D Yu, D Chen… - … on Machine Learning, 2023 - proceedings.mlr.press
It has become standard to solve NLP tasks by fine-tuning pre-trained language models
(LMs), especially in low-data settings. There is minimal theoretical understanding of …

Construction of hierarchical neural architecture search spaces based on context-free grammars

S Schrodi, D Stoll, B Ru… - Advances in …, 2024 - proceedings.neurips.cc
The discovery of neural architectures from simple building blocks is a long-standing goal of
Neural Architecture Search (NAS). Hierarchical search spaces are a promising step towards …

Efficient non-parametric optimizer search for diverse tasks

R Wang, Y Xiong, M Cheng… - Advances in Neural …, 2022 - proceedings.neurips.cc
Efficient and automated design of optimizers plays a crucial role in full-stack AutoML
systems. However, prior methods in optimizer search are often limited by their scalability …

Improving training stability for multitask ranking models in recommender systems

J Tang, Y Drori, D Chang, M Sathiamoorthy… - Proceedings of the 29th …, 2023 - dl.acm.org
Recommender systems play an important role in many content platforms. While most
recommendation research is dedicated to designing better models to improve user …

Towards discovering neural architectures from scratch

S Schrodi, D Stoll, B Ru, RS Sukthanker, T Brox… - 2022 - openreview.net
The discovery of neural architectures from scratch is the long-standing goal of Neural
Architecture Search (NAS). Searching over a wide spectrum of neural architectures can …

Structured mutation inspired by evolutionary theory enriches population performance and diversity

S Tiso, P Carvalho, N Lourenço, P Machado - arXiv preprint arXiv …, 2023 - arxiv.org
Grammar-Guided Genetic Programming (GGGP) employs a variety of insights from
evolutionary theory to autonomously design solutions for a given task. Recent insights from …

Biological insights on grammar-structured mutations improve fitness and diversity

S Tiso, P Carvalho, N Lourenço… - Proceedings of the Genetic …, 2023 - dl.acm.org
Grammar-Guided Genetic Programming (GGGP) employs a variety of concepts from
evolutionary theory to autonomously design solutions for a given task. Recent insights from …

Generalisable Agents for Neural Network Optimisation

K Tessera, CR Tilbury, S Abramowitz… - arXiv preprint arXiv …, 2023 - arxiv.org
Optimising deep neural networks is a challenging task due to complex training dynamics,
high computational requirements, and long training times. To address this difficulty, we …

Neural Optimizer Equation, Decay Function, and Learning Rate Schedule Joint Evolution

B Morgan, D Hougen - arXiv preprint arXiv:2404.06679, 2024 - arxiv.org
A major contributor to the quality of a deep learning model is the selection of the optimizer.
We propose a new dual-joint search space in the realm of neural optimizer search (NOS) …

Generalisable Agents for Neural Network Optimisation

CR Tilbury, S Abramowitz, RJ de Kock… - … 2023: Optimization for …, 2023 - openreview.net
Optimising deep neural networks is a challenging task due to complex training dynamics,
high computational requirements, and long training times. To address this difficulty, we …