CMGAN: Conformer-based metric GAN for speech enhancement

R Cao, S Abdulatif, B Yang - arXiv preprint arXiv:2203.15149, 2022 - arxiv.org
Recently, convolution-augmented transformer (Conformer) has achieved promising
performance in automatic speech recognition (ASR) and time-domain speech enhancement …

Cmgan: Conformer-based metric-gan for monaural speech enhancement

S Abdulatif, R Cao, B Yang - IEEE/ACM Transactions on Audio …, 2024 - ieeexplore.ieee.org
In this work, we further develop the conformer-based metric generative adversarial network
(CMGAN) model 1 for speech enhancement (SE) in the time-frequency (TF) domain. This …

SCP-GAN: Self-correcting discriminator optimization for training consistency preserving metric GAN on speech enhancement tasks

V Zadorozhnyy, Q Ye, K Koishida - arXiv preprint arXiv:2210.14474, 2022 - arxiv.org
In recent years, Generative Adversarial Networks (GANs) have produced significantly
improved results in speech enhancement (SE) tasks. They are difficult to train, however. In …

Speech enhancement deep-learning architecture for efficient edge processing

M Pal, A Ramanathan, T Wada, A Pandey - arXiv preprint arXiv …, 2024 - arxiv.org
Deep learning has become a de facto method of choice for speech enhancement tasks with
significant improvements in speech quality. However, real-time processing with reduced size …

Towards Ultra-Low-Power Neuromorphic Speech Enhancement with Spiking-FullSubNet

X Hao, C Ma, Q Yang, J Wu, KC Tan - arXiv preprint arXiv:2410.04785, 2024 - arxiv.org
Speech enhancement is critical for improving speech intelligibility and quality in various
audio devices. In recent years, deep learning-based methods have significantly improved …

PAMGAN+/-: Improving Phase-Aware Speech Enhancement Performance via Expanded Discriminator Training

G Close, T Hain, S Goetze - Audio Engineering Society Convention 154, 2023 - aes.org
Recent speech enhancement work, which makes use of neural networks trained with a loss
derived in part using an adversarial metric prediction network, has shown to be very …

MRGAN: LightWeight Monaural Speech Enhancement Using GAN Network

C Meng, G Wei, Y Long, C Kong, P Ma - Chinese Conference on Pattern …, 2024 - Springer
Abstract In recent years, Generative Adversarial Networks (GANs) have made significant
progress in the field of speech enhancement. However, due to the high training difficulty of …

Improving Speech Perceptual Quality and Intelligibility Through Sub-band Temporal Envelope Characteristics

R Wu, Z Huang, J Song, X Liang - National Conference on Man-Machine …, 2023 - Springer
In the speech enhancement (SE) model, using auxiliary loss based on acoustic parameters
can improve enhancement effects. However, currently used acoustic parameters focus on …

Heterogeneous Network Framework with Attention Mechanism of Speech Enhancement for Car Intelligent Cockpit Speech Recognition

YW Tan, XF Ding - 2023 26th Conference of the Oriental …, 2023 - ieeexplore.ieee.org
The success of deep learning has significantly benefited single-channel speech
enhancement in terms of intelligibility and perceptual quality. Traditional approaches have …

[PDF][PDF] Towards an Efficient and Accurate Speech Enhancement by a Comprehensive Ablation Study

LA Azcutia - 2024 - oa.upm.es
Speech enhancement tasks are methods that improve the quality and intelligibility of noisy
audio signals. To that end, speech enhancement models are trained to distinguish between …