Icassp 2023 deep noise suppression challenge

H Dubey, A Aazami, V Gopal, B Naderi… - IEEE Open Journal …, 2024 - ieeexplore.ieee.org
The ICASSP 2023 Deep Noise Suppression (DNS) Challenge marks the fifth edition of the
DNS challenge series. DNS challenges were organized from 2019 to 2023 to foster …

Speecht5: Unified-modal encoder-decoder pre-training for spoken language processing

J Ao, R Wang, L Zhou, C Wang, S Ren, Y Wu… - arXiv preprint arXiv …, 2021 - arxiv.org
Motivated by the success of T5 (Text-To-Text Transfer Transformer) in pre-trained natural
language processing models, we propose a unified-modal SpeechT5 framework that …

DNSMOS: A non-intrusive perceptual objective speech quality metric to evaluate noise suppressors

CKA Reddy, V Gopal, R Cutler - ICASSP 2021-2021 IEEE …, 2021 - ieeexplore.ieee.org
Human subjective evaluation is the" gold standard" to evaluate speech quality optimized for
human perception. Perceptual objective metrics serve as a proxy for subjective scores. The …

Interspeech 2021 deep noise suppression challenge

CKA Reddy, H Dubey, K Koishida, A Nair… - arXiv preprint arXiv …, 2021 - arxiv.org
The Deep Noise Suppression (DNS) challenge is designed to foster innovation in the area
of noise suppression to achieve superior perceptual speech quality. We recently organized …

DPT-FSNet: Dual-path transformer based full-band and sub-band fusion network for speech enhancement

F Dang, H Chen, P Zhang - ICASSP 2022-2022 IEEE …, 2022 - ieeexplore.ieee.org
Sub-band models have achieved promising results due to their ability to model local
patterns in the spectrogram. Some studies further improve the performance by fusing sub …

[PDF][PDF] SE-Conformer: Time-Domain Speech Enhancement Using Conformer.

E Kim, H Seo - Interspeech, 2021 - isca-archive.org
Convolution-augmented transformer (conformer) has recently shown competitive results in
speech-domain applications, such as automatic speech recognition, continuous speech …

Towards efficient models for real-time deep noise suppression

S Braun, H Gamper, CKA Reddy… - ICASSP 2021-2021 …, 2021 - ieeexplore.ieee.org
With recent research advancements, deep learning models are be-coming attractive and
powerful choices for speech enhancement in real-time applications. While state-of-the-art …

Noise robust automatic speech recognition: review and analysis

M Dua, Akanksha, S Dua - International Journal of Speech Technology, 2023 - Springer
Abstract Automatic Speech Recognition (ASR) system is an emerging technology used in
various fields such as robotics, traffic controls, and healthcare, etc. The leading cause of …

(Don't) try this at home! The effects of recording devices and software on phonetic analysis: Supplementary material

C Sanker, S Babinski, R Burns, M Evans, J Johns… - Language, 2021 - muse.jhu.edu
2. RESULTS 2.1. EFFECTS OF DEVICE. Here we present the full results for the effect of
each device on the acoustic measurements. All statistical results are from mixed effects …

ICASSP 2021 deep noise suppression challenge: Decoupling magnitude and phase optimization with a two-stage deep network

A Li, W Liu, X Luo, C Zheng, X Li - ICASSP 2021-2021 IEEE …, 2021 - ieeexplore.ieee.org
It remains a tough challenge to recover the speech signals contaminated by various noises
under real acoustic environments. To this end, we propose a novel system for denoising in …