Temporal-spatial neural filter: Direction informed end-to-end multi-channel target speech separation

R Gu, Y Zou - arXiv preprint arXiv:2001.00391, 2020 - arxiv.org
Target speech separation refers to extracting the target speaker's speech from mixed
signals. Despite the recent advances in deep learning based close-talk speech separation …

Recurrent deep stacking networks for supervised speech separation

ZQ Wang, DL Wang - 2017 IEEE International Conference on …, 2017 - ieeexplore.ieee.org
Supervised speech separation algorithms seldom utilize output patterns. This study
proposes a novel recurrent deep stacking approach for time-frequency masking based …

Time-domain speech extraction with spatial information and multi speaker conditioning mechanism

J Zhang, C Zorilă, R Doddipatla… - ICASSP 2021-2021 …, 2021 - ieeexplore.ieee.org
In this paper, we present a novel multi-channel speech extraction system to simultaneously
extract multiple clean individual sources from a mixture in noisy and reverberant …

A comprehensive study of speech separation: spectrogram vs waveform separation

F Bahmaninezhad, J Wu, R Gu, SX Zhang, Y Xu… - arXiv preprint arXiv …, 2019 - arxiv.org
Speech separation has been studied widely for single-channel close-talk microphone
recordings over the past few years; developed solutions are mostly in frequency-domain …

Multi-channel narrow-band deep speech separation with full-band permutation invariant training

C Quan, X Li - … 2022-2022 IEEE International Conference on …, 2022 - ieeexplore.ieee.org
This paper addresses the problem of multi-channel multi-speech separation based on deep
learning techniques. In the short time Fourier transform domain, we propose an end-to-end …

Effective low-cost time-domain audio separation using globally attentive locally recurrent networks

MWY Lam, J Wang, D Su, D Yu - 2021 IEEE Spoken Language …, 2021 - ieeexplore.ieee.org
Recent research on the time-domain audio separation networks (TasNets) has brought great
success to speech separation. Nevertheless, conventional TasNets struggle to satisfy the …

[PDF][PDF] Multi-resolution stacking for speech separation based on boosted DNN

XL Zhang, DL Wang - Sixteenth Annual Conference of the …, 2015 - xiaolei-zhang.net
Recent progress in speech separation shows that deep neural networks (DNN) based
supervised methods can improve the performance in difficult noise conditions and exhibit …

Unsupervised training for deep speech source separation with Kullback-Leibler divergence based probabilistic loss function

M Togami, Y Masuyama, T Komatsu… - ICASSP 2020-2020 …, 2020 - ieeexplore.ieee.org
In this paper, we propose a multi-channel speech source separation method with a deep
neural network (DNN) which is trained under the condition that no clean signal is available …

Effective Monoaural Speech Separation through Convolutional Top-Down Multi-View Network

AN Aung, CW Liao, JW Hung - Future Internet, 2024 - mdpi.com
Speech separation, sometimes known as the “cocktail party problem”, is the process of
separating individual speech signals from an audio mixture that includes ambient noises …

Speaker-independent speech separation with deep attractor network

Y Luo, Z Chen, N Mesgarani - IEEE/ACM Transactions on …, 2018 - ieeexplore.ieee.org
Despite the recent success of deep learning for many speech processing tasks, single-
microphone, speaker-independent speech separation remains challenging for two main …