Leveraging hierarchy in multimodal generative models for effective cross-modality inference

P Poklukar, M Vasco, H Yin, FS Melo… - International …, 2022 - proceedings.mlr.press

Learning representations of multimodal data that are both informative and robust to missing
modalities at test time remains a challenging problem due to the inherent heterogeneity of …

被引用次数：47 相关文章所有 5 个版本

COM: Contrastive Masked-attention model for incomplete multimodal learning

S Qian, C Wang - Neural Networks, 2023 - Elsevier

Most multimodal learning methods assume that all modalities are always available in data.
However, in real-world applications, the assumption is often violated due to privacy …

被引用次数：11 相关文章所有 4 个版本

[PDF] ethz.ch

MMVAE+: Enhancing the generative quality of multimodal VAEs without compromises

E Palumbo, I Daunhawer… - The Eleventh …, 2023 - research-collection.ethz.ch

Multimodal VAEs have recently gained attention as efficient models for weakly-supervised
generative learning with multiple modalities. However, all existing variants of multimodal …

被引用次数：22 相关文章所有 3 个版本

[PDF] mit.edu

MissModal: Increasing Robustness to Missing Modality in Multimodal Sentiment Analysis

R Lin, H Hu - Transactions of the Association for Computational …, 2023 - direct.mit.edu

When applying multimodal machine learning in downstream inference, both joint and
coordinated multimodal representations rely on the complete presence of modalities as in …

被引用次数：8 相关文章所有 5 个版本

[PDF] mdpi.com

Multi-modal latent diffusion

M Bounoua, G Franzese, P Michiardi - Entropy, 2024 - mdpi.com

Multimodal datasets are ubiquitous in modern applications, and multimodal Variational
Autoencoders are a popular family of models that aim to learn a joint representation of …

被引用次数：11 相关文章所有 9 个版本

Adapt and explore: Multimodal mixup for representation learning

R Lin, H Hu - Information Fusion, 2024 - Elsevier

Research on general multimodal systems has gained significant attention due to the
proliferation of multimodal data in the real world. Despite the remarkable performance …

被引用次数：6 相关文章所有 3 个版本

Neural networks special issue on Artificial Intelligence and brain science

K Doya, K Friston, M Sugiyama, J Tenenbaum - 2022 - dl.acm.org

Neural Networks special issue on Artificial Intelligence and Brain Science | Neural Networks
skip to main content ACM Digital Library home ACM home Google, Inc. (search) Advanced …

被引用次数：4 相关文章所有 5 个版本

[PDF] arxiv.org

Learning multi-modal generative models with permutation-invariant encoders and tighter variational bounds

M Hirt, D Campolo, V Leong, JP Ortega - arXiv preprint arXiv:2309.00380, 2023 - arxiv.org

Devising deep latent variable models for multi-modal data has been a long-standing theme
in machine learning research. Multi-modal Variational Autoencoders (VAEs) have been a …

MultiVae: A Python library for Multimodal Generative Autoencoders

A Senellart, C Chadebec, S Allassonnière - 2023 - hal.science

In recent years, there has been a major boom in the development of multimodal machine
learning models. Among open topics, representation (fusion) and generation of multimodal …

被引用次数：1 相关文章所有 5 个版本

[PDF] diva-portal.org

Messing With The Gap: On The Modality Gap Phenomenon In Multimodal Contrastive Representation Learning

M Al-Jaff - 2023 - diva-portal.org

In machine learning, a sub-field of computer science, a two-tower architecture model is a
specialised type of neural network model that encodes paired data from different modalities …

被引用次数：1 相关文章

高级搜索

QQ 群