Studies in affective audio–visual correspondence learning require ground-truth data to train, validate, and test models. The number of available datasets together with benchmarks …
E Frid, C Gomes, Z Jin - Proceedings of the 2020 CHI conference on …, 2020 - dl.acm.org
Short online videos have become the dominating media on social platforms. However, finding suitable music to accompany videos can be a challenging task to some video …
Cross-modal retrieval learns the relationship between the two types of data in a common space so that an input from one modality can retrieve data from a different modality. We …
Modeling the association between music and emotion has been considered important for music information retrieval and affective human computer interaction. This paper presents a …
L Prétet, G Richard, C Souchier… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
We study cross-modal recommendation of musictracks to be used as soundtracks for videos. This problem is known as the music supervision task. We build on a self-supervised system …
S Wang, C Xu, AS Ding, Z Tang - Electronics, 2021 - mdpi.com
Emotion-aware music recommendations has gained increasing attention in recent years, as music comes with the ability to regulate human emotions. Exploiting emotional information …
FF Kuo, MK Shan, SY Lee - 2013 IEEE International …, 2013 - ieeexplore.ieee.org
Automatic video editing is receiving increasingly attention as the digital camera technology develops further and social media sites such as YouTube and Flickr become popular …
Detecting complex events in videos is intrinsically a multimodal problem since both audio and visual channels provide important clues. While conventional methods fuse both …
X Wu, Y Qiao, X Wang, X Tang - IEEE Transactions on …, 2016 - ieeexplore.ieee.org
Human perceptions of music and image are closely related to each other, since both can inspire similar human sensations, such as emotion, motion, and power. This paper aims to …