UNSSOR: unsupervised neural speech separation by leveraging over-determined training mixtures

ZQ Wang, S Watanabe - Advances in Neural Information …, 2024 - proceedings.neurips.cc
In reverberant conditions with multiple concurrent speakers, each microphone acquires a
mixture signal of multiple speakers at a different location. In over-determined conditions …

USDnet: Unsupervised Speech Dereverberation via Neural Forward Filtering

ZQ Wang - arXiv preprint arXiv:2402.00820, 2024 - arxiv.org
In reverberant conditions with a single speaker, each far-field microphone records a
reverberant version of the same speaker signal at a different location. In over-determined …

Remixed2Remixed: Domain adaptation for speech enhancement by Noise2Noise learning with Remixing

L Li, S Seki - ICASSP 2024-2024 IEEE International Conference …, 2024 - ieeexplore.ieee.org
This paper proposes a domain adaptation method for speech enhancement called
Remixed2Remixed. The proposed method adopts Noise2Noise (N2N) learning to adapt …

Enhanced reverberation as supervision for unsupervised speech separation

K Saijo, G Wichern, FG Germain, Z Pan… - arXiv preprint arXiv …, 2024 - arxiv.org
Reverberation as supervision (RAS) is a framework that allows for training monaural speech
separation models from multi-channel mixtures in an unsupervised manner. In RAS, models …

[PDF][PDF] The NWPU-ByteAudio System for CHiME-7 Task 2 UDASE Challenge

Z Zhang, R Han, Z Wang, X Xia, Y Xiao… - Proc. CHiME …, 2023 - chimechallenge.org
This paper describes the NWPU-ByteAudio system for CHiME-7 Task 2-unsupervised
domain adaptation for conversational speech enhancement (UDASE). To better make use of …

[PDF][PDF] Remixing-based Unsupervised Source Separation from Scratch

K Saijo, T Ogawa - arXiv preprint arXiv:2309.00376, 2023 - isca-archive.org
We propose an unsupervised approach for training separation models from scratch using
RemixIT and Self-Remixing, which are recently proposed self-supervised learning methods …

Improved Remixing Process for Domain Adaptation-Based Speech Enhancement by Mitigating Data Imbalance in Signal-to-Noise Ratio

L Li, S Seki - arXiv preprint arXiv:2406.13982, 2024 - arxiv.org
RemixIT and Remixed2Remixed are domain adaptation-based speech enhancement
(DASE) methods that use a teacher model trained in full supervision to generate pseudo …

インターネット時代の音声コーパスの作成

高道慎之介 - 日本音響学会誌, 2024 - jstage.jst.go.jp
* Speech corpus compilation in the era of the internet.** Shinnosuke Takamichi (University
of Tokyo, Tokyo, 113–8656) e-mail: shinnosuke takamichi@ ipc. iutokyo. ac. jp [doi …