Temporal-spatial neural filter: Direction informed end-to-end multi-channel target speech separation

R Gu, Y Zou - arXiv preprint arXiv:2001.00391, 2020 - arxiv.org
Target speech separation refers to extracting the target speaker's speech from mixed
signals. Despite the recent advances in deep learning based close-talk speech separation …

An online speaker-aware speech separation approach based on time-domain representation

H Wang, Y Song, ZX Li, I McLoughlin… - ICASSP 2020-2020 …, 2020 - ieeexplore.ieee.org
Despite the significant progress of deep learning based speech separation methods, it
remains challenging to extract and track the speech from target speakers, especially in a …

[PDF][PDF] Neural Spatial Filter: Target Speaker Speech Separation Assisted with Directional Information.

R Gu, L Chen, SX Zhang, J Zheng, Y Xu, M Yu, D Su… - Interspeech, 2019 - isca-archive.org
The recent exploration of deep learning for supervised speech separation has significantly
accelerated the progress on the multi-talker speech separation problem. The multi-channel …

Speaker-independent speech separation with deep attractor network

Y Luo, Z Chen, N Mesgarani - IEEE/ACM Transactions on …, 2018 - ieeexplore.ieee.org
Despite the recent success of deep learning for many speech processing tasks, single-
microphone, speaker-independent speech separation remains challenging for two main …

[PDF][PDF] Integrating Spectral and Spatial Features for Multi-Channel Speaker Separation.

ZQ Wang, DL Wang - Interspeech, 2018 - isca-archive.org
This paper tightly integrates spectral and spatial information for deep learning based multi-
channel speaker separation. The key idea is to localize individual speakers so that an …

On end-to-end multi-channel time domain speech separation in reverberant environments

J Zhang, C Zorilă, R Doddipatla… - ICASSP 2020-2020 …, 2020 - ieeexplore.ieee.org
This paper introduces a new method for multi-channel time domain speech separation in
reverberant environments. A fully-convolutional neural network structure has been used to …

Multi-channel speech separation using spatially selective deep non-linear filters

K Tesch, T Gerkmann - IEEE/ACM Transactions on Audio …, 2023 - ieeexplore.ieee.org
In a multi-channel separation task with multiple speakers, we aim to recover all individual
speech signals from the mixture. In contrast to single-channel approaches, which rely on the …

End-to-end networks for supervised single-channel speech separation

S Venkataramani, P Smaragdis - arXiv preprint arXiv:1810.02568, 2018 - arxiv.org
The performance of single channel source separation algorithms has improved greatly in
recent times with the development and deployment of neural networks. However, many such …

Towards Real-Time Single-Channel Speech Separation in Noisy and Reverberant Environments

J Neri, S Braun - … 2023-2023 IEEE International Conference on …, 2023 - ieeexplore.ieee.org
Real-time single-channel speech separation aims to unmix an audio stream captured from a
single microphone that contains multiple people talking at once, environmental noise, and …

DPCCN: Densely-connected pyramid complex convolutional network for robust speech separation and extraction

J Han, Y Long, L Burget… - ICASSP 2022-2022 IEEE …, 2022 - ieeexplore.ieee.org
In recent years, a number of time-domain speech separation methods have been proposed.
However, most of them are very sensitive to the environments and wide domain coverage …