Adaptive multimodal fusion by uncertainty compensation.

Multimodal fusion framework: A multiresolution approach for emotion classification and recognition from physiological signals

GK Verma, US Tiwary - NeuroImage, 2014 - Elsevier

The purpose of this paper is twofold:(i) to investigate the emotion representation models and
find out the possibility of a model with minimum number of continuous dimensions and (ii) to …

被引用次数：344 相关文章所有 8 个版本

Feature selection with multi-view data: A survey

R Zhang, F Nie, X Li, X Wei - Information Fusion, 2019 - Elsevier

This survey aims at providing a state-of-the-art overview of feature selection and fusion
strategies, which select and combine multi-view features effectively to accomplish …

被引用次数：304 相关文章所有 3 个版本

[PDF] arxiv.org

Lipnet: End-to-end sentence-level lipreading

YM Assael, B Shillingford, S Whiteson… - arXiv preprint arXiv …, 2016 - arxiv.org

Lipreading is the task of decoding text from the movement of a speaker's mouth. Traditional
approaches separated the problem into two stages: designing or learning visual features …

被引用次数：455 相关文章所有 6 个版本

[PDF] stanford.edu

[PDF][PDF] Multimodal deep learning

J Ngiam, A Khosla, M Kim, J Nam, H Lee… - Proceedings of the 28th …, 2011 - ai.stanford.edu

Deep networks have been successfully applied to unsupervised feature learning for single
modalities (eg, text, images or audio). In this work, we propose a novel application of deep …

被引用次数：4030 相关文章所有 29 个版本

A cross-disciplinary comparison of multimodal data fusion approaches and applications: Accelerating learning through trans-disciplinary information sharing

R Bokade, A Navato, R Ouyang, X Jin, CA Chou… - Expert Systems with …, 2021 - Elsevier

Multimodal data fusion (MMDF) is the process of combining disparate data streams (of
different dimensionality, resolution, type, etc.) to generate information in a form that is more …

被引用次数：32 相关文章

[PDF] academia.edu

Multimodal fusion for multimedia analysis: a survey

PK Atrey, MA Hossain, A El Saddik, MS Kankanhalli - Multimedia systems, 2010 - Springer

This survey aims at providing multimedia researchers with a state-of-the-art overview of
fusion strategies, which are used for combining multiple modalities in order to accomplish …

被引用次数：1422 相关文章所有 14 个版本

[PDF] arxiv.org

Large-scale visual speech recognition

B Shillingford, Y Assael, MW Hoffman, T Paine… - arXiv preprint arXiv …, 2018 - arxiv.org

This work presents a scalable solution to open-vocabulary visual speech recognition. To
achieve this, we constructed the largest existing visual speech recognition dataset …

被引用次数：193 相关文章所有 7 个版本

[PDF] arxiv.org

Deep multimodal learning for audio-visual speech recognition

Y Mroueh, E Marcheret, V Goel - 2015 IEEE International …, 2015 - ieeexplore.ieee.org

In this paper, we present methods in deep multimodal learning for fusing speech and visual
modalities for Audio-Visual Automatic Speech Recognition (AV-ASR). First, we study an …

被引用次数：287 相关文章所有 6 个版本

[HTML] sciencedirect.com

[HTML][HTML] A 3D-convolutional neural network framework with ensemble learning techniques for multi-modal emotion recognition

ES Salama, RA El-Khoribi, ME Shoman… - Egyptian Informatics …, 2021 - Elsevier

Nowadays, human emotion recognition is a mandatory task for many human machine
interaction fields. This paper proposes a novel multi-modal human emotion recognition …

被引用次数：82 相关文章所有 2 个版本

[PDF] innovators-guide.ch

[PDF][PDF] Lipnet: Sentence-level lipreading

YM Assael, B Shillingford, S Whiteson… - arXiv preprint arXiv …, 2016 - innovators-guide.ch

Lipreading is the task of decoding text from the movement of a speaker's mouth. Traditional
approaches separated the problem into two stages: designing or learning visual features …

被引用次数：176 相关文章所有 5 个版本

高级搜索

QQ 群