Learning a cross-domain embedding space of vocal and mixed audio with a structure-preserving...

SH Doh, M Won, K Choi, J Nam - ICASSP 2023-2023 IEEE …, 2023 - ieeexplore.ieee.org

This paper introduces effective design choices for text-to-music retrieval systems. An ideal
text-based retrieval system would support various input queries such as pre-defined tags …

被引用次数：28 相关文章所有 4 个版本

[PDF] arxiv.org

Strada: A singer traits dataset

Y Kong, VA Tran, R Hennequin - arXiv preprint arXiv:2406.04140, 2024 - arxiv.org

There is a limited amount of large-scale public datasets that contain downloadable music
audio files and rich lead singer metadata. To provide such a dataset to benefit research in …

被引用次数：3 相关文章所有 4 个版本

[PDF] hal.science

Zero-note samba: Self-supervised beat tracking

D Desblancs, V Lostanlen… - IEEE/ACM Transactions …, 2023 - ieeexplore.ieee.org

Supervised machine learning for music information retrieval requires a large annotated
training set, and is thus an expensive and time-consuming process. To circumvent this …

被引用次数：12 相关文章所有 4 个版本

[PDF] arxiv.org

Textless speech-to-music retrieval using emotion similarity

SH Doh, M Won, K Choi, J Nam - ICASSP 2023-2023 IEEE …, 2023 - ieeexplore.ieee.org

We introduce a framework that recommends music based on the emotions of speech. In
content creation and daily life, speech contains information about human emotions, which …

被引用次数：3 相关文章所有 4 个版本

[PDF] arxiv.org

Multi-Source Contrastive Learning from Musical Audio

C Garoufis, A Zlatintsi, P Maragos - arXiv preprint arXiv:2302.07077, 2023 - arxiv.org

Contrastive learning constitutes an emerging branch of self-supervised learning that
leverages large amounts of unlabeled data, by learning a latent space, where pairs of …

被引用次数：11 相关文章所有 6 个版本

[PDF] arxiv.org

高级搜索

QQ 群