Language model alignment with elastic reset

M Noukhovitch, S Lavoie, F Strub… - Advances in Neural …, 2024 - proceedings.neurips.cc
Finetuning language models with reinforcement learning (RL), eg from human feedback
(HF), is a prominent method for alignment. But optimizing against a reward model can …

A review of the applications of deep learning-based emergent communication

B Boldt, D Mortensen - arXiv preprint arXiv:2407.03302, 2024 - arxiv.org
Emergent communication, or emergent language, is the field of research which studies how
human language-like communication systems emerge de novo in deep multi-agent …

A Survey on Emergent Language

J Peters, CW de Puiseau, H Tercan… - arXiv preprint arXiv …, 2024 - arxiv.org
The field of emergent language represents a novel area of research within the domain of
artificial intelligence, particularly within the context of multi-agent reinforcement learning …

The curious case of representational alignment: Unravelling visio-linguistic tasks in emergent communication

T Kouwenhoven, M Peeperkorn, B Van Dijk… - arXiv preprint arXiv …, 2024 - arxiv.org
Natural language has the universal properties of being compositional and grounded in
reality. The emergence of linguistic properties is often investigated through simulations of …

Accelerating language emergence by functional pressures

K Vithanage, R Wijesinghe, A Xavier, D Tissera… - Plos one, 2023 - journals.plos.org
In language emergence, neural agents acquire communication skills by interacting with one
another and the environment. Through these interactions, agents learn to connect or ground …

Learning to translate by learning to communicate

CM Downey, X Zhou, LZ Liu… - arXiv preprint arXiv …, 2022 - arxiv.org
We formulate and test a technique to use Emergent Communication (EC) with a pre-trained
multilingual model to improve on modern Unsupervised NMT systems, especially for low …

What does Kiki look like? Cross-modal associations between speech sounds and visual shapes in vision-and-language models

T Verhoef, K Shahrasbi, T Kouwenhoven - arXiv preprint arXiv:2407.17974, 2024 - arxiv.org
Humans have clear cross-modal preferences when matching certain novel words to visual
shapes. Evidence suggests that these preferences play a prominent role in our linguistic …

Speech Self-Supervised Learning Using Diffusion Model Synthetic Data

H Gao, K Qian, J Ni, C Gan… - Forty-first International … - openreview.net
While self-supervised learning (SSL) in speech has greatly reduced the reliance of speech
processing systems on annotated corpora, the success of SSL still hinges on the availability …

Interpretation Errors: Extracting Functionality From Generative Models of Language by Understanding Them Better

A Holtzman - 2023 - search.proquest.com
The rise of large language models as the workhorse of NLP, and the continuous release of
better models (OpenAI, 2023; Pichai, 2023; Schulman et al., 2022, inter alia) has created a …

Unsupervised speech technology for low-resource languages

H Gao - 2024 - ideals.illinois.edu
Deep neural network based speech processing systems have found widespread
applications in daily life, being employed for tasks such as automatic speech recognition …