Geometric multimodal contrastive representation learning

P Poklukar, M Vasco, H Yin, FS Melo… - International …, 2022 - proceedings.mlr.press
Learning representations of multimodal data that are both informative and robust to missing
modalities at test time remains a challenging problem due to the inherent heterogeneity of …

COM: Contrastive Masked-attention model for incomplete multimodal learning

S Qian, C Wang - Neural Networks, 2023 - Elsevier
Most multimodal learning methods assume that all modalities are always available in data.
However, in real-world applications, the assumption is often violated due to privacy …

MMVAE+: Enhancing the generative quality of multimodal VAEs without compromises

E Palumbo, I Daunhawer… - The Eleventh …, 2023 - research-collection.ethz.ch
Multimodal VAEs have recently gained attention as efficient models for weakly-supervised
generative learning with multiple modalities. However, all existing variants of multimodal …

MissModal: Increasing Robustness to Missing Modality in Multimodal Sentiment Analysis

R Lin, H Hu - Transactions of the Association for Computational …, 2023 - direct.mit.edu
When applying multimodal machine learning in downstream inference, both joint and
coordinated multimodal representations rely on the complete presence of modalities as in …

Multi-modal latent diffusion

M Bounoua, G Franzese, P Michiardi - Entropy, 2024 - mdpi.com
Multimodal datasets are ubiquitous in modern applications, and multimodal Variational
Autoencoders are a popular family of models that aim to learn a joint representation of …

Adapt and explore: Multimodal mixup for representation learning

R Lin, H Hu - Information Fusion, 2024 - Elsevier
Research on general multimodal systems has gained significant attention due to the
proliferation of multimodal data in the real world. Despite the remarkable performance …

Neural networks special issue on Artificial Intelligence and brain science

K Doya, K Friston, M Sugiyama, J Tenenbaum - 2022 - dl.acm.org
Neural Networks special issue on Artificial Intelligence and Brain Science | Neural Networks
skip to main content ACM Digital Library home ACM home Google, Inc. (search) Advanced …

Learning multi-modal generative models with permutation-invariant encoders and tighter variational bounds

M Hirt, D Campolo, V Leong, JP Ortega - arXiv preprint arXiv:2309.00380, 2023 - arxiv.org
Devising deep latent variable models for multi-modal data has been a long-standing theme
in machine learning research. Multi-modal Variational Autoencoders (VAEs) have been a …

MultiVae: A Python library for Multimodal Generative Autoencoders

A Senellart, C Chadebec, S Allassonnière - 2023 - hal.science
In recent years, there has been a major boom in the development of multimodal machine
learning models. Among open topics, representation (fusion) and generation of multimodal …

Messing With The Gap: On The Modality Gap Phenomenon In Multimodal Contrastive Representation Learning

M Al-Jaff - 2023 - diva-portal.org
In machine learning, a sub-field of computer science, a two-tower architecture model is a
specialised type of neural network model that encodes paired data from different modalities …