Pre-training Feature Guided Diffusion Model for Speech Enhancement

Y Yang, N Trigoni, A Markham - arXiv preprint arXiv:2406.07646, 2024 - arxiv.org
Speech enhancement significantly improves the clarity and intelligibility of speech in noisy
environments, improving communication and listening experiences. In this paper, we …

Diffusion-based speech enhancement with a weighted generative-supervised learning loss

JE Ayilo, M Sadeghi, R Serizel - ICASSP 2024-2024 IEEE …, 2024 - ieeexplore.ieee.org
Diffusion-based generative models have recently gained attention in speech enhancement
(SE), providing an alternative to conventional supervised methods. These models transform …

Dose: Diffusion dropout with adaptive prior for speech enhancement

W Tai, Y Lei, F Zhou, G Trajcevski… - Advances in Neural …, 2024 - proceedings.neurips.cc
Speech enhancement (SE) aims to improve the intelligibility and quality of speech in the
presence of non-stationary additive noise. Deterministic deep learning models have …

Unsupervised speech enhancement with diffusion-based generative models

B Nortier, M Sadeghi, R Serizel - ICASSP 2024-2024 IEEE …, 2024 - ieeexplore.ieee.org
Recently, conditional score-based diffusion models have gained significant attention in the
field of supervised speech enhancement, yielding state-of-the-art performance. However …

Cross-domain diffusion based speech enhancement for very noisy speech

H Wang, DL Wang - ICASSP 2023-2023 IEEE International …, 2023 - ieeexplore.ieee.org
Deep learning based speech enhancement has achieved remarkable success, but
challenges remain in low signal-to-noise ratio (SNR) nonstationary noise scenarios. In this …

Coarse-to-fine optimization for speech enhancement

J Yao, A Al-Dahle - arXiv preprint arXiv:1908.08044, 2019 - arxiv.org
In this paper, we propose the coarse-to-fine optimization for the task of speech
enhancement. Cosine similarity loss [1] has proven to be an effective metric to measure …

Noise-aware speech enhancement using diffusion probabilistic model

Y Hu, C Chen, R Li, Q Zhu, ES Chng - arXiv preprint arXiv:2307.08029, 2023 - arxiv.org
With recent advances of diffusion model, generative speech enhancement (SE) has
attracted a surge of research interest due to its great potential for unseen testing noises …

Diffiner: A Versatile Diffusion-based Generative Refiner for Speech Enhancement

R Sawata, N Murata, Y Takida, T Uesaka… - arXiv preprint arXiv …, 2022 - arxiv.org
Although deep neural network (DNN)-based speech enhancement (SE) methods outperform
the previous non-DNN-based ones, they often degrade the perceptual quality of generated …

Se-bridge: Speech enhancement with consistent brownian bridge

Z Qiu, M Fu, F Sun, G Altenbek, H Huang - arXiv preprint arXiv:2305.13796, 2023 - arxiv.org
We propose SE-Bridge, a novel method for speech enhancement (SE). After recently
applying the diffusion models to speech enhancement, we can achieve speech …

Conditional diffusion probabilistic model for speech enhancement

YJ Lu, ZQ Wang, S Watanabe… - ICASSP 2022-2022 …, 2022 - ieeexplore.ieee.org
Speech enhancement is a critical component of many user-oriented audio applications, yet
current systems still suffer from distorted and unnatural outputs. While generative models …