[PDF][PDF] Monaural speech separation method based on recurrent attention with parallel branches

X Yang, C Bao, X Zhang, X Chen - Proc. Interspeech, 2023 - drive.google.com
In many speech separation methods, the contextual information contained in the feature
sequence is mainly modeled by recurrent layer and/or self-attention mechanism. However …

Single-channel speech extraction using speaker inventory and attention network

X Xiao, Z Chen, T Yoshioka, H Erdogan… - ICASSP 2019-2019 …, 2019 - ieeexplore.ieee.org
Neural network-based speech separation has received a surge of interest in recent years.
Previously proposed methods either are speaker independent or extract a target speaker's …

Librimix: An open-source dataset for generalizable speech separation

J Cosentino, M Pariente, S Cornell, A Deleforge… - arXiv preprint arXiv …, 2020 - arxiv.org
In recent years, wsj0-2mix has become the reference dataset for single-channel speech
separation. Most deep learning-based speech separation models today are benchmarked …

Monaural speech separation using dual-output deep neural network with multiple joint constraint

S Linhui, L Wenqing, Z Meng… - Chinese Journal of …, 2023 - ieeexplore.ieee.org
Monaural speech separation is a significant research field in speech signal processing. To
achieve a better separation performance, we propose three novel joint-constraint loss …

Real-time multichannel speech separation and enhancement using a beamspace-domain-based lightweight CNN

M Olivieri, L Comanducci, M Pezzoli… - ICASSP 2023-2023 …, 2023 - ieeexplore.ieee.org
The problems of speech separation and enhancement concern the extraction of the speech
emitted by a target speaker when placed in a scenario where multiple interfering speakers …

FurcaNeXt: End-to-end monaural speech separation with dynamic gated dilated temporal convolutional networks

L Zhang, Z Shi, J Han, A Shi, D Ma - … , Daejeon, South Korea, January 5–8 …, 2020 - Springer
Deep dilated temporal convolutional networks (TCN) have been proved to be very effective
in sequence modeling. In this paper we propose several improvements of TCN for end-to …

Graph convolution-based deep clustering for speech separation

S Qin, T Jiang, S Wu, N Wang, X Zhao - IEEE Access, 2020 - ieeexplore.ieee.org
Deep clustering is a promising technique for speech separation that is crucial to speech
communication, acoustic target detection, acoustic enhancement and speech recognition. In …

Source-aware context network for single-channel multi-speaker speech separation

ZX Li, Y Song, LR Dai… - 2018 IEEE International …, 2018 - ieeexplore.ieee.org
Deep learning based approaches have achieved promising performance in speaker-
dependent single-channel multispeaker speech separation. However, partly due to the label …

Time-domain speaker extraction network

C Xu, W Rao, ES Chng, H Li - 2019 IEEE Automatic Speech …, 2019 - ieeexplore.ieee.org
Speaker extraction is to extract a target speaker's voice from multi-talker speech. It simulates
humans' cocktail party effect or the selective listening ability. The prior work mostly performs …

Scenario-Aware Audio-Visual TF-GridNet for Target Speech Extraction

Z Pan, G Wichern, Y Masuyama… - 2023 IEEE Automatic …, 2023 - ieeexplore.ieee.org
Target speech extraction aims to extract, based on a given conditioning cue, a target speech
signal that is corrupted by interfering sources, such as noise or competing speakers …