On the binding problem in artificial neural networks

K Greff, S Van Steenkiste, J Schmidhuber - arXiv preprint arXiv …, 2020 - arxiv.org
Contemporary neural networks still fall short of human-level generalization, which extends
far beyond our direct experiences. In this paper, we argue that the underlying cause for this …

Mindstorms in natural language-based societies of mind

M Zhuge, H Liu, F Faccio, DR Ashley… - arXiv preprint arXiv …, 2023 - arxiv.org
Both Minsky's" society of mind" and Schmidhuber's" learning to think" inspire diverse
societies of large multimodal neural networks (NNs) that solve problems by interviewing …

3d implicit transporter for temporally consistent keypoint discovery

C Zhong, Y Zheng, Y Zheng, H Zhao… - Proceedings of the …, 2023 - openaccess.thecvf.com
Keypoint-based representation has proven advantageous in various visual and robotic
tasks. However, the existing 2D and 3D methods for detecting keypoints mainly rely on …

Contrastive training of complex-valued autoencoders for object discovery

A Stanić, A Gopalakrishnan, K Irie… - Advances in Neural …, 2024 - proceedings.neurips.cc
Current state-of-the-art object-centric models use slots and attention-based routing for
binding. However, this class of models has several conceptual limitations: the number of …

Systematic visual reasoning through object-centric relational abstraction

T Webb, SS Mondal, JD Cohen - Advances in Neural …, 2024 - proceedings.neurips.cc
Human visual reasoning is characterized by an ability to identify abstract patterns from only
a small number of examples, and to systematically generalize those patterns to novel inputs …

Dance of SNN and ANN: solving binding problem by combining spike timing and reconstructive attention

H Zheng, H Lin, R Zhao, L Shi - Advances in Neural …, 2022 - proceedings.neurips.cc
The binding problem is one of the fundamental challenges that prevent the artificial neural
network (ANNs) from a compositional understanding of the world like human perception …

Vael: Bridging variational autoencoders and probabilistic logic programming

E Misino, G Marra, E Sansone - Advances in Neural …, 2022 - proceedings.neurips.cc
We present VAEL, a neuro-symbolic generative model integrating variational autoencoders
(VAE) with the reasoning capabilities of probabilistic logic (L) programming. Besides …

Self-supervised attention-aware reinforcement learning

H Wu, K Khetarpal, D Precup - Proceedings of the AAAI Conference on …, 2021 - ojs.aaai.org
Visual saliency has emerged as a major visualization tool for interpreting deep
reinforcement learning (RL) agents. However, much of the existing research uses it as an …

Associating objects and their effects in video through coordination games

E Lu, F Cole, W Xie, T Dekel… - Advances in …, 2022 - proceedings.neurips.cc
We explore a feed-forward approach for decomposing a video into layers, where each layer
contains an object of interest along with its associated shadows, reflections, and other visual …

Unsupervised image representation learning with deep latent particles

T Daniel, A Tamar - arXiv preprint arXiv:2205.15821, 2022 - arxiv.org
We propose a new representation of visual data that disentangles object position from
appearance. Our method, termed Deep Latent Particles (DLP), decomposes the visual input …