This textbook provides both profound technological knowledge and a comprehensive treatment of essential topics in music processing and music information retrieval. Including …
N Sturmel, L Daudet - International conference on digital audio effects …, 2011 - dafx.de
This paper presents a review on techniques for signal reconstruction without phase, ie when only the spectrogram (the squared magnitude of the Short Time Fourier Transform) of the …
K Han, Y Wang, DL Wang, WS Woods… - … on Audio, Speech …, 2015 - ieeexplore.ieee.org
In real-world environments, human speech is usually distorted by both reverberation and background noise, which have negative effects on speech intelligibility and speech quality …
There has been a recent surge in adversarial attacks on deep learning based automatic speech recognition (ASR) systems. These attacks pose new challenges to deep learning …
N Perraudin, P Balazs… - 2013 IEEE workshop on …, 2013 - ieeexplore.ieee.org
In this paper, we present a new algorithm to estimate a signal from its short-time Fourier transform modulus (STFTM). This algorithm is computationally simple and is obtained by an …
In recent years, deep networks have led to dramatic improvements in speech enhancement by framing it as a data-driven pattern recognition problem. In many modern enhancement …
Z Průša, P Balazs… - IEEE/ACM Transactions on …, 2017 - ieeexplore.ieee.org
A noniterative method for the reconstruction of the short-time fourier transform (STFT) phase from the magnitude is presented. The method is based on the direct relationship between …
While recent neural sequence-to-sequence models have greatly improved the quality of speech synthesis, there has not been a system capable of fast training, fast inference and …
Human speech in real-world environments is typically degraded by the background noise. They have a negative impact on perceptual speech quality and intelligibility which causes …