- 学术资源搜索

UNSSOR: unsupervised neural speech separation by leveraging over-determined training mixtures

ZQ Wang, S Watanabe - Advances in Neural Information …, 2024 - proceedings.neurips.cc

In reverberant conditions with multiple concurrent speakers, each microphone acquires a
mixture signal of multiple speakers at a different location. In over-determined conditions …

被引用次数：11 相关文章所有 8 个版本

[PDF] arxiv.org

Speech separation with pretrained frontend to minimize domain mismatch

W Wang, Z Pan, X Li, S Wang… - IEEE/ACM Transactions on …, 2024 - ieeexplore.ieee.org

Speech separation seeks to separate individual speech signals from a speech mixture.
Typically, most separation models are trained on synthetic data due to the unavailability of …

被引用次数：3 相关文章所有 5 个版本

[PDF] arxiv.org

Speech separation with large-scale self-supervised learning

Z Chen, N Kanda, J Wu, Y Wu, X Wang… - ICASSP 2023-2023 …, 2023 - ieeexplore.ieee.org

Self-supervised learning (SSL) methods such as WavLM have shown promising speech
separation (SS) results in small-scale simulation-based experiments. In this work, we extend …

被引用次数：13 相关文章所有 3 个版本

Neural speech enhancement with unsupervised pre-training and mixture training

X Hao, C Xu, L Xie - Neural Networks, 2023 - Elsevier

Supervised neural speech enhancement methods always require a large scale of paired
noisy and clean speech data. Since collecting adequate paired data from real-world …

被引用次数：15 相关文章所有 4 个版本

[PDF] arxiv.org

Self-remixing: Unsupervised speech separation via separation and remixing

K Saijo, T Ogawa - ICASSP 2023-2023 IEEE International …, 2023 - ieeexplore.ieee.org

We present Self-Remixing, a novel self-supervised speech separation method, which refines
a pre-trained separation model in an unsupervised manner. Self-Remixing consists of a …

被引用次数：8 相关文章所有 5 个版本

[PDF] arxiv.org

PixIT: Joint Training of Speaker Diarization and Speech Separation from Real-world Multi-speaker Recordings

J Kalda, R Marxer, T Alumäe, H Bredin - arXiv preprint arXiv:2403.02288, 2024 - arxiv.org

A major drawback of supervised speech separation (SSep) systems is their reliance on
synthetic data, leading to poor real-world generalization. Mixture invariant training (MixIT) …

被引用次数：10 相关文章所有 2 个版本

[PDF] arxiv.org

Efficient personalized speech enhancement through self-supervised learning

A Sivaraman, M Kim - IEEE Journal of Selected Topics in Signal …, 2022 - ieeexplore.ieee.org

This work presents self-supervised learning methods for monaural speaker-specific (ie,
personalized) speech enhancement models. While general-purpose models must broadly …

被引用次数：23 相关文章所有 5 个版本

[PDF] arxiv.org

Unsupervised multi-channel separation and adaptation

C Han, K Wilson, S Wisdom… - ICASSP 2024-2024 …, 2024 - ieeexplore.ieee.org

A key challenge in machine learning is to generalize from training data to an application
domain of interest. This work extends the recently-proposed mixture invariant training (MixIT) …

被引用次数：4 相关文章所有 3 个版本

[PDF] arxiv.org

Reverberation as Supervision for Speech Separation

R Aralikatti, C Boeddeker, G Wichern… - ICASSP 2023-2023 …, 2023 - ieeexplore.ieee.org

This paper proposes reverberation as supervision (RAS), a novel unsupervised loss function
for single-channel reverberant speech separation. Prior methods for unsupervised …

被引用次数：6 相关文章所有 3 个版本

[PDF] isca-archive.org

[PDF][PDF] Using semi-supervised learning for monaural time-domain speech separation with a self-supervised learning-based si-snr estimator

S Dang, T Matsumoto, Y Takeuchi, H Kudo - Interspeech 2023., 2023 - isca-archive.org

Speech separation aims to decompose mixed speeches into independent signals. Prior
research on monaural time-domain speech separation has made great progress in …

被引用次数：7 相关文章所有 3 个版本

高级搜索

QQ 群

UNSSOR: unsupervised neural speech separation by leveraging over-determined training mixtures

Speech separation with pretrained frontend to minimize domain mismatch

Speech separation with large-scale self-supervised learning

Neural speech enhancement with unsupervised pre-training and mixture training

Self-remixing: Unsupervised speech separation via separation and remixing

PixIT: Joint Training of Speaker Diarization and Speech Separation from Real-world Multi-speaker Recordings

Efficient personalized speech enhancement through self-supervised learning

Unsupervised multi-channel separation and adaptation

Reverberation as Supervision for Speech Separation

[PDF][PDF] Using semi-supervised learning for monaural time-domain speech separation with a self-supervised learning-based si-snr estimator

引用