相关文章- 学术资源搜索

A regression approach to single-channel speech separation via high-resolution deep neural networks

J Du, Y Tu, LR Dai, CH Lee - IEEE/ACM Transactions on Audio …, 2016 - ieeexplore.ieee.org

We propose a novel data-driven approach to single-channel speech separation based on
deep neural networks (DNNs) to directly model the highly nonlinear relationship between …

被引用次数：101 相关文章所有 6 个版本

[PDF] arxiv.org

Spmamba: State-space model is all you need in speech separation

K Li, G Chen - arXiv preprint arXiv:2404.02063, 2024 - arxiv.org

In speech separation, both CNN-and Transformer-based models have demonstrated robust
separation capabilities, garnering significant attention within the research community …

被引用次数：8 相关文章所有 2 个版本

[PDF] arxiv.org

Multi-channel speech separation using spatially selective deep non-linear filters

K Tesch, T Gerkmann - IEEE/ACM Transactions on Audio …, 2023 - ieeexplore.ieee.org

In a multi-channel separation task with multiple speakers, we aim to recover all individual
speech signals from the mixture. In contrast to single-channel approaches, which rely on the …

被引用次数：3 相关文章所有 4 个版本

[PDF] researchgate.net

Deep stacking networks with time series for speech separation

S Nie, H Zhang, XL Zhang… - 2014 IEEE International …, 2014 - ieeexplore.ieee.org

In many present speech separation approaches, the separation task is formulated as a
binary classification problem. Several classification-based approaches have been proposed …

被引用次数：33 相关文章所有 5 个版本

[PDF] arxiv.org

Deep attention fusion feature for speech separation with end-to-end post-filter method

C Fan, J Tao, B Liu, J Yi, Z Wen, X Liu - arXiv preprint arXiv:2003.07544, 2020 - arxiv.org

In this paper, we propose an end-to-end post-filter method with deep attention fusion
features for monaural speaker-independent speech separation. At first, a time-frequency …

被引用次数：10 相关文章所有 2 个版本

[PDF] arxiv.org

Multi-dimensional and multi-scale modeling for speech separation optimized by discriminative learning

Z Mu, X Yang, W Zhu - ICASSP 2023-2023 IEEE International …, 2023 - ieeexplore.ieee.org

Transformer has shown advanced performance in speech separation, benefiting from its
ability to capture global features. However, capturing local features and channel information …

被引用次数：4 相关文章所有 3 个版本

[PDF] arxiv.org

On end-to-end multi-channel time domain speech separation in reverberant environments

J Zhang, C Zorilă, R Doddipatla… - ICASSP 2020-2020 …, 2020 - ieeexplore.ieee.org

This paper introduces a new method for multi-channel time domain speech separation in
reverberant environments. A fully-convolutional neural network structure has been used to …

被引用次数：45 相关文章所有 5 个版本

[PDF] arxiv.org

An efficient encoder-decoder architecture with top-down attention for speech separation

K Li, R Yang, X Hu - arXiv preprint arXiv:2209.15200, 2022 - arxiv.org

Deep neural networks have shown excellent prospects in speech separation tasks.
However, obtaining good results while keeping a low model complexity remains challenging …

被引用次数：34 相关文章所有 3 个版本

[PDF] arxiv.org

TransMask: A compact and fast speech separation model based on transformer

Z Zhang, B He, Z Zhang - ICASSP 2021-2021 IEEE …, 2021 - ieeexplore.ieee.org

Speech separation is an important problem in speech processing, which targets to separate
and generate clean speech from a mixed audio containing speech from different speakers …

被引用次数：22 相关文章所有 3 个版本

[PDF] arxiv.org

Tiny-sepformer: A tiny time-domain transformer network for speech separation

J Luo, J Wang, N Cheng, E Xiao, X Zhang… - arXiv preprint arXiv …, 2022 - arxiv.org

Time-domain Transformer neural networks have proven their superiority in speech
separation tasks. However, these models usually have a large number of network …

被引用次数：11 相关文章所有 6 个版本

高级搜索

QQ 群

A regression approach to single-channel speech separation via high-resolution deep neural networks

Spmamba: State-space model is all you need in speech separation

Multi-channel speech separation using spatially selective deep non-linear filters

Deep stacking networks with time series for speech separation

Deep attention fusion feature for speech separation with end-to-end post-filter method

Multi-dimensional and multi-scale modeling for speech separation optimized by discriminative learning

On end-to-end multi-channel time domain speech separation in reverberant environments

An efficient encoder-decoder architecture with top-down attention for speech separation

TransMask: A compact and fast speech separation model based on transformer

Tiny-sepformer: A tiny time-domain transformer network for speech separation

引用