As one of the most intuitive interfaces known to humans, natural language has the potential to mediate many tasks that involve human-computer interaction, especially in application …
Coronavirus has an impact on millions of lives and has been added to the important pandemics that continue to affect with its variants. Since it is transmitted through the …
Towards improving the performance in various music information processing tasks, recent studies exploit different modalities able to capture diverse aspects of music. Such modalities …
Creating novel interpretations of existing musical compositions is and has always been an essential part of musical practice. Before the advent of recorded music, listening to a piece of …
WC Payne, AY Xu, F Ahmed, L Ye, A Hurst - Proceedings of the 22nd …, 2020 - dl.acm.org
Today, music creation software and hardware are central to the workflow of most professional composers, producers, and songwriters. Music is an aural art form, but it is …
Cross-modal retrieval learns the relationship between the two types of data in a common space so that an input from one modality can retrieve data from a different modality. We …
The analysis of recorded audio material using computational methods has received increased attention in ethnomusicological research. We present a curated dataset of …
In the field of music information retrieval (MIR), cover song identification (CSI) is a challenging task that aims to identify cover versions of a query song from a massive …
D Zeng, J Wu, G Hattori, R Xu, Y Yu - ACM Transactions on Multimedia …, 2023 - dl.acm.org
Audio-visual tracks in video contain rich semantic information with potential in many applications and research. Since the audio-visual data have inconsistent distributions and …