Foundation models for music: A survey

Y Ma, A Øland, A Ragni, BMS Del Sette, C Saitis… - arXiv preprint arXiv …, 2024 - arxiv.org
In recent years, foundation models (FMs) such as large language models (LLMs) and latent
diffusion models (LDMs) have profoundly impacted diverse sectors, including music. This …

Multi-source diffusion models for simultaneous music generation and separation

G Mariani, I Tallini, E Postolache, M Mancusi… - arXiv preprint arXiv …, 2023 - arxiv.org
In this work, we define a diffusion-based generative model capable of both music synthesis
and source separation by learning the score of the joint probability density of sources …

Musical timbre style transfer with diffusion model

H Huang, J Man, L Li, R Zeng - PeerJ Computer Science, 2024 - peerj.com
In this work, we focus on solving the problem of timbre transfer in audio samples. The goal is
to transfer the source audio's timbre from one instrument to another while retaining as much …

Extract and Diffuse: Latent Integration for Improved Diffusion-based Speech and Vocal Enhancement

Y Yang, Z Liu, W Yu, G Sun, Q Kong… - arXiv preprint arXiv …, 2024 - arxiv.org
Diffusion-based generative models have recently achieved remarkable results in speech
and vocal enhancement due to their ability to model complex speech data distributions …

Improving Source Extraction with Diffusion and Consistency Models

T Karchkhadze, MR Izadi, S Zhang - arXiv preprint arXiv:2412.06965, 2024 - arxiv.org
In this work, we demonstrate the integration of a score-matching diffusion model into a
deterministic architecture for time-domain musical source extraction, resulting in enhanced …

Timbre transfer using image-to-image denoising diffusion models

L Comanducci, F Antonacci, A Sarti - arXiv preprint arXiv:2307.04586, 2023 - arxiv.org
Timbre transfer techniques aim at converting the sound of a musical piece generated by one
instrument into the same one as if it was played by another instrument, while maintaining as …

[PDF][PDF] Timbre transfer using image-to-image denoising diffusion implicit models

L Comanducci, F Antonacci, A Sarti - … 5-9, 2023 (ISBN: 978-1 …, 2024 - re.public.polimi.it
Timbre transfer techniques aim at converting the sound of a musical piece generated by one
instrument into the same one as if it was played by another instrument, while maintaining as …

Harnessing the capabilities of Generative Models

G Mariani - 2024 - tesidottorato.depositolegale.it
Generative models have experienced significant advancements in recent years, driven by
the introduction of architectures such as Stable Diffusion, GPT-3, ChatGPT, and many …

From source separation to compositional music generation

E Postolache - 2024 - iris.uniroma1.it
This thesis proposes a journey into sound processing through deep learning, particularly
generative models, exploring the compositional structure of sound, which is layered in …

Blind signal separation applications and methods

M Monastyrskyi - Загальнодержавний науково-виробничий та …, 2024 - eee.khpi.edu.ua
Blind signal separation is the task of separating the given mixture signal into two or more
corresponding sources. It finds an application in many fields of human activity such as …