DPCCN: Densely-connected pyramid complex convolutional network for robust speech separation and extraction

J Han, Y Long, L Burget… - ICASSP 2022-2022 IEEE …, 2022 - ieeexplore.ieee.org
In recent years, a number of time-domain speech separation methods have been proposed.
However, most of them are very sensitive to the environments and wide domain coverage …

Selinet: a lightweight model for single channel speech separation

HM Tan, DQ Vu, JC Wang - ICASSP 2023-2023 IEEE …, 2023 - ieeexplore.ieee.org
The time-domain speech separation methods adopting deep learning have obtained
impressive performance. However, the computational complexity, model size, and …

Conv-tasnet: Surpassing ideal time–frequency magnitude masking for speech separation

Y Luo, N Mesgarani - IEEE/ACM transactions on audio, speech …, 2019 - ieeexplore.ieee.org
Single-channel, speaker-independent speech separation methods have recently seen great
progress. However, the accuracy, latency, and computational cost of such methods remain …

TFCnet: time-frequency domain corrector for speech separation

W Tong, J Zhu, J Chen, Z Wu, S Kang… - ICASSP 2023-2023 …, 2023 - ieeexplore.ieee.org
Deep learning-based methods have made significant achievements in speech separation.
Especially the time-domain separation methods have achieved the best performance in …

Improved speech separation with time-and-frequency cross-domain joint embedding and clustering

GP Yang, CI Tuan, HY Lee, L Lee - arXiv preprint arXiv:1904.07845, 2019 - arxiv.org
Speech separation has been very successful with deep learning techniques. Substantial
effort has been reported based on approaches over spectrogram, which is well known as the …

Multichannel speech separation with narrow-band conformer

C Quan, X Li - arXiv preprint arXiv:2204.04464, 2022 - arxiv.org
This work proposes a multichannel speech separation method with narrow-band Conformer
(named NBC). The network is trained to learn to automatically exploit narrow-band speech …

[PDF][PDF] Multi-scale group transformer for long sequence modeling in speech separation

Y Zhao, C Luo, ZJ Zha, W Zeng - Proceedings of the Twenty-Ninth …, 2021 - ijcai.org
In this paper, we introduce Transformer to the timedomain methods for single-channel
speech separation. Transformer has the potential to boost speech separation performance …

[PDF][PDF] End-to-End Speech Separation Using Orthogonal Representation in Complex and Real Time-Frequency Domain.

K Wang, H Huang, Y Hu, Z Huang, S Li - Interspeech, 2021 - isca-archive.org
Traditional single channel speech separation in the timefrequency (TF) domain often faces
the problem of phase reconstruction. Due to the fact that the real-valued network is not …

TFPSNet: Time-frequency domain path scanning network for speech separation

L Yang, W Liu, W Wang - ICASSP 2022-2022 IEEE …, 2022 - ieeexplore.ieee.org
Speech separation has been very successful with deep learning techniques. In this paper,
we propose time-frequency (TF) domain path scanning network (TFPSNet) for speech …

Sandglasset: A light multi-granularity self-attentive network for time-domain speech separation

MWY Lam, J Wang, D Su, D Yu - ICASSP 2021-2021 IEEE …, 2021 - ieeexplore.ieee.org
One of the leading single-channel speech separation (SS) models is based on a TasNet
with a dual-path segmentation technique, where the size of each segment remains …