A review of differentiable digital signal processing for music and speech synthesis

B Hayes, J Shier, G Fazekas, A McPherson… - Frontiers in Signal …, 2024 - frontiersin.org
The term “differentiable digital signal processing” describes a family of techniques in which
loss function gradients are backpropagated through digital signal processors, facilitating …

[图书][B] Handbook of artificial intelligence for music

ER Miranda - 2021 - Springer
I am delighted to be in a position to write this preface: it is the last task that I need to get done
before I submit the manuscript for production. I have just gone through the checklist. All good …

Score-informed source separation of choral music

M Gover - 2020 - escholarship.mcgill.ca
La séparation de sources sonores consiste à extraire une ou plusieurs sources présentant
un attrait significatif d'un enregistrement contenant plusieurs sources sonores. Ces …

Applications of deep learning to audio generation

Y Zhao, X Xia, R Togneri - IEEE Circuits and Systems …, 2019 - ieeexplore.ieee.org
In the recent past years, deep learning based machine learning systems have demonstrated
remarkable success for a wide range of learning tasks in multiple domains such as computer …

Shimon sings-robotic musicianship finds its voice

R Savery, L Zahray, G Weinberg - Handbook of Artificial Intelligence for …, 2021 - Springer
Abstract Robotic Musicianship research at Georgia Tech Center for Music Technology
(GTCMT) focuses on the construction of autonomous and wearable robots that analyze …

Learning meshes for dense visual SLAM

M Bloesch, T Laidlow, R Clark… - Proceedings of the …, 2019 - openaccess.thecvf.com
Estimating motion and surrounding geometry of a moving camera remains a challenging
inference problem. From an information theoretic point of view, estimates should get better …

Singing synthesis: with a little help from my attention

O Angelini, A Moinet, K Yanagisawa… - arXiv preprint arXiv …, 2019 - arxiv.org
We present UTACO, a singing synthesis model based on an attention-based sequence-to-
sequence mechanism and a vocoder based on dilated causal convolutions. These two …

Transparency in music-generative AI: A systematic literature review

R Batlle-Roca, E Gómez, WH Liao, X Serra, Y Mitsufuji - 2023 - researchsquare.com
Music-generative AI raises multiple challenges particularly related to the work of artists, the
existing music industry model, the role of AI in creative processes, and the discussion of …

Deep autotuner: A pitch correcting network for singing performances

S Wager, G Tzanetakis, C Wang… - ICASSP 2020-2020 IEEE …, 2020 - ieeexplore.ieee.org
We introduce a data-driven approach to automatic pitch correction of solo singing
performances. The proposed approach predicts note-wise pitch shifts from the relationship …

Content based singing voice source separation via strong conditioning using aligned phonemes

G Meseguer-Brocal, G Peeters - arXiv preprint arXiv:2008.02070, 2020 - arxiv.org
Informed source separation has recently gained renewed interest with the introduction of
neural networks and the availability of large multitrack datasets containing both the mixture …