相关文章- 学术资源搜索

[PDF][PDF] Monaural speech separation method based on recurrent attention with parallel branches

X Yang, C Bao, X Zhang, X Chen - Proc. Interspeech, 2023 - drive.google.com

In many speech separation methods, the contextual information contained in the feature
sequence is mainly modeled by recurrent layer and/or self-attention mechanism. However …

被引用次数：3 相关文章所有 3 个版本

Single-channel speech extraction using speaker inventory and attention network

X Xiao, Z Chen, T Yoshioka, H Erdogan… - ICASSP 2019-2019 …, 2019 - ieeexplore.ieee.org

Neural network-based speech separation has received a surge of interest in recent years.
Previously proposed methods either are speaker independent or extract a target speaker's …

被引用次数：73 相关文章

[PDF] arxiv.org

Librimix: An open-source dataset for generalizable speech separation

J Cosentino, M Pariente, S Cornell, A Deleforge… - arXiv preprint arXiv …, 2020 - arxiv.org

In recent years, wsj0-2mix has become the reference dataset for single-channel speech
separation. Most deep learning-based speech separation models today are benchmarked …

被引用次数：234 相关文章所有 5 个版本

[PDF] ieee.org

Monaural speech separation using dual-output deep neural network with multiple joint constraint

S Linhui, L Wenqing, Z Meng… - Chinese Journal of …, 2023 - ieeexplore.ieee.org

Monaural speech separation is a significant research field in speech signal processing. To
achieve a better separation performance, we propose three novel joint-constraint loss …

被引用次数：3 相关文章所有 2 个版本

[PDF] polimi.it

Real-time multichannel speech separation and enhancement using a beamspace-domain-based lightweight CNN

M Olivieri, L Comanducci, M Pezzoli… - ICASSP 2023-2023 …, 2023 - ieeexplore.ieee.org

The problems of speech separation and enhancement concern the extraction of the speech
emitted by a target speaker when placed in a scenario where multiple interfering speakers …

被引用次数：6 相关文章所有 4 个版本

[PDF] arxiv.org

FurcaNeXt: End-to-end monaural speech separation with dynamic gated dilated temporal convolutional networks

L Zhang, Z Shi, J Han, A Shi, D Ma - … , Daejeon, South Korea, January 5–8 …, 2020 - Springer

Deep dilated temporal convolutional networks (TCN) have been proved to be very effective
in sequence modeling. In this paper we propose several improvements of TCN for end-to …

被引用次数：105 相关文章所有 4 个版本

[PDF] ieee.org

Graph convolution-based deep clustering for speech separation

S Qin, T Jiang, S Wu, N Wang, X Zhao - IEEE Access, 2020 - ieeexplore.ieee.org

Deep clustering is a promising technique for speech separation that is crucial to speech
communication, acoustic target detection, acoustic enhancement and speech recognition. In …

被引用次数：10 相关文章所有 3 个版本

[PDF] kent.ac.uk

Source-aware context network for single-channel multi-speaker speech separation

ZX Li, Y Song, LR Dai… - 2018 IEEE International …, 2018 - ieeexplore.ieee.org

Deep learning based approaches have achieved promising performance in speaker-
dependent single-channel multispeaker speech separation. However, partly due to the label …

被引用次数：12 相关文章所有 6 个版本

[PDF] arxiv.org

Time-domain speaker extraction network

C Xu, W Rao, ES Chng, H Li - 2019 IEEE Automatic Speech …, 2019 - ieeexplore.ieee.org

Speaker extraction is to extract a target speaker's voice from multi-talker speech. It simulates
humans' cocktail party effect or the selective listening ability. The prior work mostly performs …

被引用次数：58 相关文章所有 7 个版本

[PDF] arxiv.org

Scenario-Aware Audio-Visual TF-GridNet for Target Speech Extraction

Z Pan, G Wichern, Y Masuyama… - 2023 IEEE Automatic …, 2023 - ieeexplore.ieee.org

Target speech extraction aims to extract, based on a given conditioning cue, a target speech
signal that is corrupted by interfering sources, such as noise or competing speakers …

被引用次数：1 相关文章所有 6 个版本

高级搜索

QQ 群

[PDF][PDF] Monaural speech separation method based on recurrent attention with parallel branches

Single-channel speech extraction using speaker inventory and attention network

Librimix: An open-source dataset for generalizable speech separation

Monaural speech separation using dual-output deep neural network with multiple joint constraint

Real-time multichannel speech separation and enhancement using a beamspace-domain-based lightweight CNN

FurcaNeXt: End-to-end monaural speech separation with dynamic gated dilated temporal convolutional networks

Graph convolution-based deep clustering for speech separation

Source-aware context network for single-channel multi-speaker speech separation

Time-domain speaker extraction network

Scenario-Aware Audio-Visual TF-GridNet for Target Speech Extraction

引用