Getting aligned on representational alignment

I Sucholutsky, L Muttenthaler, A Weller, A Peng… - arXiv preprint arXiv …, 2023 - arxiv.org
Biological and artificial information processing systems form representations that they can
use to categorize, reason, plan, navigate, and make decisions. How can we measure the …

Alignment with human representations supports robust few-shot learning

I Sucholutsky, T Griffiths - Advances in Neural Information …, 2024 - proceedings.neurips.cc
Should we care whether AI systems have representations of the world that are similar to
those of humans? We provide an information-theoretic analysis that suggests that there …

Human uncertainty in concept-based ai systems

KM Collins, M Barker, M Espinosa Zarlenga… - Proceedings of the …, 2023 - dl.acm.org
Placing a human in the loop may help abate the risks of deploying AI systems in safety-
critical settings (eg, a clinician working with a medical AI system). However, mitigating risks …

Binary classification with confidence difference

W Wang, L Feng, Y Jiang, G Niu… - Advances in …, 2024 - proceedings.neurips.cc
Recently, learning with soft labels has been shown to achieve better performance than
learning with hard labels in terms of model generalization, calibration, and robustness …

Human-in-the-loop mixup

KM Collins, U Bhatt, W Liu, V Piratla… - Uncertainty in …, 2023 - proceedings.mlr.press
Aligning model representations to humans has been found to improve robustness and
generalization. However, such methods often focus on standard observational data …

Data for mathematical copilots: Better ways of presenting proofs for machine learning

S Frieder, J Bayer, KM Collins, J Berner… - arXiv preprint arXiv …, 2024 - arxiv.org
The suite of datasets commonly used to train and evaluate the mathematical capabilities of
AI-based mathematical copilots (primarily large language models) exhibit several …

Beyond Thumbs Up/Down: Untangling Challenges of Fine-Grained Feedback for Text-to-Image Generation

KM Collins, N Kim, Y Bitton, V Rieser… - Proceedings of the …, 2024 - ojs.aaai.org
Human feedback plays a critical role in learning and refining reward models for text-to-
image generation, but the optimal form the feedback should take for learning an accurate …

Using Contrastive Learning with Generative Similarity to Learn Spaces that Capture Human Inductive Biases

R Marjieh, S Kumar, D Campbell, L Zhang… - arXiv preprint arXiv …, 2024 - arxiv.org
Humans rely on strong inductive biases to learn from few examples and abstract useful
information from sensory data. Instilling such biases in machine learning models has been …

Revisiting Rogers' Paradox in the Context of Human-AI Interaction

KM Collins, U Bhatt, I Sucholutsky - arXiv preprint arXiv:2501.10476, 2025 - arxiv.org
Humans learn about the world, and how to act in the world, in many ways: from individually
conducting experiments to observing and reproducing others' behavior. Different learning …

Characterizing Similarities and Divergences in Conversational Tones in Humans and LLMs by Sampling with People

DM Huang, P Van Rijn, I Sucholutsky, R Marjieh… - arXiv preprint arXiv …, 2024 - arxiv.org
Conversational tones--the manners and attitudes in which speakers communicate--are
essential to effective communication. Amidst the increasing popularization of Large …