Dose: Diffusion dropout with adaptive prior for speech enhancement

W Tai, Y Lei, F Zhou, G Trajcevski… - Advances in Neural …, 2024 - proceedings.neurips.cc
Speech enhancement (SE) aims to improve the intelligibility and quality of speech in the
presence of non-stationary additive noise. Deterministic deep learning models have …

Revisiting denoising diffusion probabilistic models for speech enhancement: Condition collapse, efficiency and refinement

W Tai, F Zhou, G Trajcevski, T Zhong - Proceedings of the AAAI …, 2023 - ojs.aaai.org
Recent literature has shown that denoising diffusion probabilistic models (DDPMs) can be
used to synthesize high-fidelity samples with a competitive (or sometimes better) quality than …

Diffusion-based Speech Enhancement with Schr\" odinger Bridge and Symmetric Noise Schedule

S Wang, S Liu, A Harper, P Kendrick… - arXiv preprint arXiv …, 2024 - arxiv.org
Recently, diffusion-based generative models have demonstrated remarkable performance in
speech enhancement tasks. However, these methods still encounter challenges, including …

Cross-domain diffusion based speech enhancement for very noisy speech

H Wang, DL Wang - ICASSP 2023-2023 IEEE International …, 2023 - ieeexplore.ieee.org
Deep learning based speech enhancement has achieved remarkable success, but
challenges remain in low signal-to-noise ratio (SNR) nonstationary noise scenarios. In this …

Nadiffuse: Noise-aware diffusion-based model for speech enhancement

W Wang, D Yang, Q Ye, B Cao… - 2023 Asia Pacific Signal …, 2023 - ieeexplore.ieee.org
The goal of speech enhancement (SE) is to eliminate the background interference from the
noisy speech signal. Generative models such as diffusion models (DM) have been applied …

URGENT Challenge: Universality, Robustness, and Generalizability For Speech Enhancement

W Zhang, R Scheibler, K Saijo, S Cornell, C Li… - arXiv preprint arXiv …, 2024 - arxiv.org
The last decade has witnessed significant advancements in deep learning-based speech
enhancement (SE). However, most existing SE research has limitations on the coverage of …

Conditional diffusion probabilistic model for speech enhancement

YJ Lu, ZQ Wang, S Watanabe… - ICASSP 2022-2022 …, 2022 - ieeexplore.ieee.org
Speech enhancement is a critical component of many user-oriented audio applications, yet
current systems still suffer from distorted and unnatural outputs. While generative models …

[PDF][PDF] A versatile diffusion-based generative refiner for speech enhancement

R Sawata, N Murata, Y Takida, T Uesaka… - arXiv preprint arXiv …, 2022 - academia.edu
Although deep neural network (DNN)-based speech enhancement (SE) methods outperform
the previous non-DNN-based ones, they often degrade the perceptual quality of generated …

Dynamic noise embedding: Noise aware training and adaptation for speech enhancement

J Lee, Y Jung, M Jung, H Kim - 2020 Asia-Pacific Signal and …, 2020 - ieeexplore.ieee.org
Estimating noise information exactly is crucial for noise aware training in speech
applications including speech enhancement (SE) which is our focus in this paper. To …

Toward universal speech enhancement for diverse input conditions

W Zhang, K Saijo, ZQ Wang… - 2023 IEEE Automatic …, 2023 - ieeexplore.ieee.org
The past decade has witnessed substantial growth of data-driven speech enhancement (SE)
techniques thanks to deep learning. While existing approaches have shown impressive …