相关文章- 学术资源搜索

Temporal-spatial neural filter: Direction informed end-to-end multi-channel target speech separation

R Gu, Y Zou - arXiv preprint arXiv:2001.00391, 2020 - arxiv.org

Target speech separation refers to extracting the target speaker's speech from mixed
signals. Despite the recent advances in deep learning based close-talk speech separation …

被引用次数：17 相关文章所有 3 个版本

[PDF] github.io

Recurrent deep stacking networks for supervised speech separation

ZQ Wang, DL Wang - 2017 IEEE International Conference on …, 2017 - ieeexplore.ieee.org

Supervised speech separation algorithms seldom utilize output patterns. This study
proposes a novel recurrent deep stacking approach for time-frequency masking based …

被引用次数：38 相关文章所有 3 个版本

[PDF] arxiv.org

Time-domain speech extraction with spatial information and multi speaker conditioning mechanism

J Zhang, C Zorilă, R Doddipatla… - ICASSP 2021-2021 …, 2021 - ieeexplore.ieee.org

In this paper, we present a novel multi-channel speech extraction system to simultaneously
extract multiple clean individual sources from a mixture in noisy and reverberant …

被引用次数：14 相关文章所有 4 个版本

[PDF] arxiv.org

A comprehensive study of speech separation: spectrogram vs waveform separation

F Bahmaninezhad, J Wu, R Gu, SX Zhang, Y Xu… - arXiv preprint arXiv …, 2019 - arxiv.org

Speech separation has been studied widely for single-channel close-talk microphone
recordings over the past few years; developed solutions are mostly in frequency-domain …

被引用次数：91 相关文章所有 7 个版本

[PDF] arxiv.org

Multi-channel narrow-band deep speech separation with full-band permutation invariant training

C Quan, X Li - … 2022-2022 IEEE International Conference on …, 2022 - ieeexplore.ieee.org

This paper addresses the problem of multi-channel multi-speech separation based on deep
learning techniques. In the short time Fourier transform domain, we propose an end-to-end …

被引用次数：15 相关文章所有 4 个版本

[PDF] arxiv.org

Effective low-cost time-domain audio separation using globally attentive locally recurrent networks

MWY Lam, J Wang, D Su, D Yu - 2021 IEEE Spoken Language …, 2021 - ieeexplore.ieee.org

Recent research on the time-domain audio separation networks (TasNets) has brought great
success to speech separation. Nevertheless, conventional TasNets struggle to satisfy the …

被引用次数：31 相关文章所有 6 个版本

[PDF] xiaolei-zhang.net

[PDF][PDF] Multi-resolution stacking for speech separation based on boosted DNN

XL Zhang, DL Wang - Sixteenth Annual Conference of the …, 2015 - xiaolei-zhang.net

Recent progress in speech separation shows that deep neural networks (DNN) based
supervised methods can improve the performance in difficult noise conditions and exhibit …

被引用次数：11 相关文章所有 4 个版本

[PDF] arxiv.org

Unsupervised training for deep speech source separation with Kullback-Leibler divergence based probabilistic loss function

M Togami, Y Masuyama, T Komatsu… - ICASSP 2020-2020 …, 2020 - ieeexplore.ieee.org

In this paper, we propose a multi-channel speech source separation method with a deep
neural network (DNN) which is trained under the condition that no clean signal is available …

被引用次数：30 相关文章所有 4 个版本

[PDF] mdpi.com

Effective Monoaural Speech Separation through Convolutional Top-Down Multi-View Network

AN Aung, CW Liao, JW Hung - Future Internet, 2024 - mdpi.com

Speech separation, sometimes known as the “cocktail party problem”, is the process of
separating individual speech signals from an audio mixture that includes ambient noises …

被引用次数：1 相关文章所有 4 个版本

[PDF] ieee.org

Speaker-independent speech separation with deep attractor network

Y Luo, Z Chen, N Mesgarani - IEEE/ACM Transactions on …, 2018 - ieeexplore.ieee.org

Despite the recent success of deep learning for many speech processing tasks, single-
microphone, speaker-independent speech separation remains challenging for two main …

被引用次数：287 相关文章所有 6 个版本

高级搜索

QQ 群

Temporal-spatial neural filter: Direction informed end-to-end multi-channel target speech separation

Recurrent deep stacking networks for supervised speech separation

Time-domain speech extraction with spatial information and multi speaker conditioning mechanism

A comprehensive study of speech separation: spectrogram vs waveform separation

Multi-channel narrow-band deep speech separation with full-band permutation invariant training

Effective low-cost time-domain audio separation using globally attentive locally recurrent networks

[PDF][PDF] Multi-resolution stacking for speech separation based on boosted DNN

Unsupervised training for deep speech source separation with Kullback-Leibler divergence based probabilistic loss function

Effective Monoaural Speech Separation through Convolutional Top-Down Multi-View Network

Speaker-independent speech separation with deep attractor network

引用