What makes the difference? An empirical comparison of fusion strategies for multimodal language analysis

D Gkoumas, Q Li, C Lioma, Y Yu, D Song - Information Fusion, 2021 - Elsevier
Multimodal video sentiment analysis is a rapidly growing area. It combines verbal (ie,
linguistic) and non-verbal modalities (ie, visual, acoustic) to predict the sentiment of …

Cross-modal retrieval for knowledge-based visual question answering

P Lerner, O Ferret, C Guinaudeau - European Conference on Information …, 2024 - Springer
Abstract Knowledge-based Visual Question Answering about Named Entities is a
challenging task that requires retrieving information from a multimodal Knowledge Base …

Background music recommendation based on latent factors and moods

CL Liu, YC Chen - Knowledge-Based Systems, 2018 - Elsevier
Many mobile devices are equipped with video shooting function, and users tend to use
these mobile devices to produce user generated content (UGC), and share with friends or …

[HTML][HTML] A rule-based obfuscating focused crawler in the audio retrieval domain

M Montanaro, AM Rinaldi, C Russo… - Multimedia Tools and …, 2024 - Springer
The detection of violations of intellectual properties on multimedia files is a critical problem
for the current infrastructure of the Internet, especially within very large document collections …

[HTML][HTML] Canonical cortical graph neural networks and its application for speech enhancement in audio-visual hearing aids

LA Passos, JP Papa, A Hussain, A Adeel - Neurocomputing, 2023 - Elsevier
Despite the recent success of machine learning algorithms, most models face drawbacks
when considering more complex tasks requiring interaction between different sources, such …

Asking Multimodal Clarifying Questions in Mixed-Initiative Conversational Search

Y Yuan, C Siro, M Aliannejadi, M Rijke… - Proceedings of the ACM …, 2024 - dl.acm.org
In mixed-initiative conversational search systems, clarifying questions aid users who
struggle to express their intentions in a single query. These questions aim to uncover user's …

Modeling uncertainty in bibliometrics and information retrieval: an information fusion approach

A Karlsson, B Hammarfelt, HJ Steinhauer, G Falkman… - Scientometrics, 2015 - Springer
We describe ongoing research where the aim is to apply recent results from the research
field of information fusion to bibliometric analysis and information retrieval. We highlight the …

[HTML][HTML] Using knowledge graphs for audio retrieval: a case study on copyright infringement detection

M Montanaro, AM Rinaldi, C Russo, C Tommasino - World Wide Web, 2024 - Springer
Identifying cases of intellectual property violation in multimedia files poses significant
challenges for the Internet infrastructure, especially when dealing with extensive document …

MM-FOOD: a high-dimensional index structure for efficiently querying content and concept of multimedia data

S Arslan, A Yazici - Journal of Intelligent & Fuzzy Systems, 2023 - content.iospress.com
The semantic query problem is commonly called the semantic gap and is one of the
significant problems in multimedia data retrieval. In this study, we focus on multimedia data …

[PDF][PDF] Multi-view learning review: understanding methods and their application

KI Bae, YS Lee, C Lim - The Korean Journal of Applied Statistics, 2019 - koreascience.kr
Multi-view learning considers data from various viewpoints as well as attempts to integrate
various information from data. Multi-view learning has been studied recently and has …