A diffusion-inspired training strategy for singing voice extraction in the waveform domain

Y Ma, A Øland, A Ragni, BMS Del Sette, C Saitis… - arXiv preprint arXiv …, 2024 - arxiv.org

In recent years, foundation models (FMs) such as large language models (LLMs) and latent
diffusion models (LDMs) have profoundly impacted diverse sectors, including music. This …

被引用次数：12 相关文章所有 3 个版本

[PDF] arxiv.org

Multi-source diffusion models for simultaneous music generation and separation

G Mariani, I Tallini, E Postolache, M Mancusi… - arXiv preprint arXiv …, 2023 - arxiv.org

In this work, we define a diffusion-based generative model capable of both music synthesis
and source separation by learning the score of the joint probability density of sources …

被引用次数：37 相关文章所有 3 个版本

[PDF] peerj.com

Musical timbre style transfer with diffusion model

H Huang, J Man, L Li, R Zeng - PeerJ Computer Science, 2024 - peerj.com

In this work, we focus on solving the problem of timbre transfer in audio samples. The goal is
to transfer the source audio's timbre from one instrument to another while retaining as much …

被引用次数：2 相关文章所有 3 个版本

[PDF] arxiv.org

Extract and Diffuse: Latent Integration for Improved Diffusion-based Speech and Vocal Enhancement

Y Yang, Z Liu, W Yu, G Sun, Q Kong… - arXiv preprint arXiv …, 2024 - arxiv.org

Diffusion-based generative models have recently achieved remarkable results in speech
and vocal enhancement due to their ability to model complex speech data distributions …

被引用次数：1 相关文章

[PDF] arxiv.org

Improving Source Extraction with Diffusion and Consistency Models

T Karchkhadze, MR Izadi, S Zhang - arXiv preprint arXiv:2412.06965, 2024 - arxiv.org

In this work, we demonstrate the integration of a score-matching diffusion model into a
deterministic architecture for time-domain musical source extraction, resulting in enhanced …

Timbre transfer using image-to-image denoising diffusion models

L Comanducci, F Antonacci, A Sarti - arXiv preprint arXiv:2307.04586, 2023 - arxiv.org

Timbre transfer techniques aim at converting the sound of a musical piece generated by one
instrument into the same one as if it was played by another instrument, while maintaining as …

被引用次数：1 相关文章所有 2 个版本

[PDF] polimi.it

[PDF][PDF] Timbre transfer using image-to-image denoising diffusion implicit models

L Comanducci, F Antonacci, A Sarti - … 5-9, 2023 (ISBN: 978-1 …, 2024 - re.public.polimi.it

Timbre transfer techniques aim at converting the sound of a musical piece generated by one
instrument into the same one as if it was played by another instrument, while maintaining as …

被引用次数：4 相关文章所有 6 个版本

[PDF] depositolegale.it

Harnessing the capabilities of Generative Models

G Mariani - 2024 - tesidottorato.depositolegale.it

Generative models have experienced significant advancements in recent years, driven by
the introduction of architectures such as Stable Diffusion, GPT-3, ChatGPT, and many …

From source separation to compositional music generation

E Postolache - 2024 - iris.uniroma1.it

This thesis proposes a journey into sound processing through deep learning, particularly
generative models, exploring the compositional structure of sound, which is layered in …

[PDF] khpi.edu.ua

Blind signal separation applications and methods

M Monastyrskyi - Загальнодержавний науково-виробничий та …, 2024 - eee.khpi.edu.ua

Blind signal separation is the task of separating the given mixture signal into two or more
corresponding sources. It finds an application in many fields of human activity such as …

高级搜索

QQ 群