Foundation models for music: A survey

Y Ma, A Øland, A Ragni, BMS Del Sette, C Saitis… - arXiv preprint arXiv …, 2024 - arxiv.org
In recent years, foundation models (FMs) such as large language models (LLMs) and latent
diffusion models (LDMs) have profoundly impacted diverse sectors, including music. This …

Multimodal pretraining, adaptation, and generation for recommendation: A survey

Q Liu, J Zhu, Y Yang, Q Dai, Z Du, XM Wu… - Proceedings of the 30th …, 2024 - dl.acm.org
Personalized recommendation serves as a ubiquitous channel for users to discover
information tailored to their interests. However, traditional recommendation models primarily …

Multitrack music transformer

HW Dong, K Chen, S Dubnov… - ICASSP 2023-2023 …, 2023 - ieeexplore.ieee.org
Existing approaches for generating multitrack music with transformer models have been
limited in terms of the number of instruments, the length of the music segments and slow …

Clamp: Contrastive language-music pre-training for cross-modal symbolic music information retrieval

S Wu, D Yu, X Tan, M Sun - arXiv preprint arXiv:2304.11029, 2023 - arxiv.org
We introduce CLaMP: Contrastive Language-Music Pre-training, which learns cross-modal
representations between natural language and symbolic music using a music encoder and …

The music maestro or the musically challenged, a massive music evaluation benchmark for large language models

J Li, L Yang, M Tang, C Chen, Z Li, P Wang… - arXiv preprint arXiv …, 2024 - arxiv.org
Benchmark plays a pivotal role in assessing the advancements of large language models
(LLMs). While numerous benchmarks have been proposed to evaluate LLMs' capabilities …

GTR-CTRL: instrument and genre conditioning for guitar-focused music generation with transformers

P Sarmento, A Kumar, YH Chen, CJ Carr… - … Intelligence in Music …, 2023 - Springer
Recently, symbolic music generation with deep learning techniques has witnessed steady
improvements. Most works on this topic focus on MIDI representations, but less attention has …

Mrbert: Pre-training of melody and rhythm for automatic music generation

S Li, Y Sung - Mathematics, 2023 - mdpi.com
Deep learning technology has been extensively studied for its potential in music, notably for
creative music generation research. Traditional music generation approaches based on …

Fine-grained position helps memorizing more, a novel music compound transformer model with feature interaction fusion

Z Li, R Gong, Y Chen, K Su - Proceedings of the AAAI Conference on …, 2023 - ojs.aaai.org
Due to the particularity of the simultaneous occurrence of multiple events in music
sequences, compound Transformer is proposed to deal with the challenge of long …

MMD-MII model: a multilayered analysis and multimodal integration interaction approach revolutionizing music emotion classification

J Wang, A Sharifi, TR Gadekallu, A Shankar - International Journal of …, 2024 - Springer
Music plays a vital role in human culture and society, serving as a universal form of
expression. However, accurately classifying music emotions remains challenging due to the …

Composer classification using melodic combinatorial n-grams

DAP Alvarez, A Gelbukh, G Sidorov - Expert Systems with Applications, 2024 - Elsevier
In the present study, we investigate the supervised problem of composer classification. From
a set of compositions and a set of composers, we seek to assign each composition to the …