Digital twin studies for reverse engineering the origins of visual intelligence

JN Wood, L Pandey, SMW Wood - Annual Review of Vision …, 2024 - annualreviews.org
What are the core learning algorithms in brains? Nativists propose that intelligence emerges
from innate domain-specific knowledge systems, whereas empiricists propose that …

Beyond learnability: understanding human visual development with DNNs

L Yuan - Trends in cognitive sciences, 2024 - cell.com
Abstract Recently, Orhan and Lake demonstrated the computational plausibility that children
can acquire sophisticated visual representations from natural input data without inherent …

Is Child-Directed Speech Effective Training Data for Language Models?

SY Feng, ND Goodman, MC Frank - arXiv preprint arXiv:2408.03617, 2024 - arxiv.org
While high-performing language models are typically trained on hundreds of billions of
words, human children become fluent language users with a much smaller amount of data …

Parallel development of social behavior in biological and artificial fish

JD McGraw, D Lee, JN Wood - Nature Communications, 2024 - nature.com
Our algorithmic understanding of vision has been revolutionized by a reverse engineering
paradigm that involves building artificial systems that perform the same tasks as biological …

From Prototypes to General Distributions: An Efficient Curriculum for Masked Image Modeling

J Lin, CE Wu, H Li, J Zhang, YH Hu… - arXiv preprint arXiv …, 2024 - arxiv.org
Masked Image Modeling (MIM) has emerged as a powerful self-supervised learning
paradigm for visual representation learning, enabling models to acquire rich visual …

Human Gaze Boosts Object-Centered Representation Learning

T Schaumlöffel, A Aubret, G Roig, J Triesch - arXiv preprint arXiv …, 2025 - arxiv.org
Recent self-supervised learning (SSL) models trained on human-like egocentric visual
inputs substantially underperform on image recognition tasks compared to humans. These …

The BabyView dataset: High-resolution egocentric videos of infants' and young children's everyday experiences

B Long, V Xiang, S Stojanov, RZ Sparks, Z Yin… - arXiv preprint arXiv …, 2024 - arxiv.org
Human children far exceed modern machine learning algorithms in their sample efficiency,
achieving high performance in key domains with much less data than current models …

Discovering Hidden Visual Concepts Beyond Linguistic Input in Infant Learning

X Ke, S Tsutsui, Y Zhang, B Wen - arXiv preprint arXiv:2501.05205, 2025 - arxiv.org
Infants develop complex visual understanding rapidly, even preceding of the acquisition of
linguistic inputs. As computer vision seeks to replicate the human vision system …

Active Gaze Behavior Boosts Self-Supervised Object Learning

Z Yu, A Aubret, MC Raabe, J Yang, C Yu… - arXiv preprint arXiv …, 2024 - arxiv.org
Due to significant variations in the projection of the same object from different viewpoints,
machine learning algorithms struggle to recognize the same object across various …

Learning Object Semantic Similarity with Self-Supervision

A Aubret, T Schaumlöffel, G Roig, J Triesch - arXiv preprint arXiv …, 2024 - arxiv.org
Humans judge the similarity of two objects not just based on their visual appearance but
also based on their semantic relatedness. However, it remains unclear how humans learn …