TF-GridNet: Integrating full-and sub-band modeling for speech separation

ZQ Wang, S Cornell, S Choi, Y Lee… - … on Audio, Speech …, 2023 - ieeexplore.ieee.org
We propose TF-GridNet for speech separation. The model is a novel deep neural network
(DNN) integrating full-and sub-band modeling in the time-frequency (TF) domain. It stacks …

SpatialNet: Extensively learning spatial information for multichannel joint speech separation, denoising and dereverberation

C Quan, X Li - IEEE/ACM Transactions on Audio, Speech, and …, 2024 - ieeexplore.ieee.org
This work proposes a neural network to extensively exploit spatial information for
multichannel joint speech separation, denoising and dereverberation, named SpatialNet. In …

Towards unified all-neural beamforming for time and frequency domain speech separation

R Gu, SX Zhang, Y Zou, D Yu - IEEE/ACM Transactions on …, 2022 - ieeexplore.ieee.org
Recently, frequency domain all-neural beamforming methods have achieved remarkable
progress for multichannel speech separation. In parallel, the integration of time domain …

Dasformer: Deep alternating spectrogram transformer for multi/single-channel speech separation

S Wang, X Kong, X Peng, H Movassagh… - ICASSP 2023-2023 …, 2023 - ieeexplore.ieee.org
For the task of speech separation, previous study usually treats multi-channel and single-
channel scenarios as two research tracks with specialized solutions developed respectively …

[PDF][PDF] Zoneformer: On-device Neural Beamformer For In-car Multi-zone Speech Separation, Enhancement and Echo Cancellation

Y Xu, V Kothapally, M Yu, S Zhang, D Yu - Proc. INTERSPEECH, 2023 - isca-archive.org
Despite the recent success of all-neural beamforming approaches for speech separation,
deploying them onto lowpowered devices is difficult due to their demanding computational …

Deep neural mel-subband beamformer for in-car speech separation

V Kothapally, Y Xu, M Yu, SX Zhang… - ICASSP 2023-2023 …, 2023 - ieeexplore.ieee.org
While current deep learning (DL)-based beamforming techniques have been proved
effective in speech separation, they are often designed to process narrow-band (NB) …

Multi-channel speech separation using spatially selective deep non-linear filters

K Tesch, T Gerkmann - IEEE/ACM Transactions on Audio …, 2023 - ieeexplore.ieee.org
In a multi-channel separation task with multiple speakers, we aim to recover all individual
speech signals from the mixture. In contrast to single-channel approaches, which rely on the …

NBC2: Multichannel speech separation with revised narrow-band conformer

C Quan, X Li - arXiv preprint arXiv:2212.02076, 2022 - arxiv.org
This work proposes a multichannel narrow-band speech separation network. In the short-
time Fourier transform (STFT) domain, the proposed network processes each frequency …

Non-linear Spatial Filtering for Multi-channel Speech Enhancement and Separation

K Tesch - 2023 - ediss.sub.uni-hamburg.de
A large part of human speech communication takes place in noisy environments and is
supported by technical devices. For example, a hearing-impaired person might use a …

[PDF][PDF] Time-frequency Domain Filter-and-sum Network for Multi-channel Speech Separation

Z Deng, Y Zhou, H Liu - isca-archive.org
Learning-based methods have made impressive strides in speech separation, and the
implicit filter-and-sum network (iFaSNet) stands out as a reliable multi-channel solution …