ZQ Wang - arXiv preprint arXiv:2402.00820, 2024 - arxiv.org
In reverberant conditions with a single speaker, each far-field microphone records a reverberant version of the same speaker signal at a different location. In over-determined …
L Li, S Seki - ICASSP 2024-2024 IEEE International Conference …, 2024 - ieeexplore.ieee.org
This paper proposes a domain adaptation method for speech enhancement called Remixed2Remixed. The proposed method adopts Noise2Noise (N2N) learning to adapt …
Reverberation as supervision (RAS) is a framework that allows for training monaural speech separation models from multi-channel mixtures in an unsupervised manner. In RAS, models …
Z Zhang, R Han, Z Wang, X Xia, Y Xiao… - Proc. CHiME …, 2023 - chimechallenge.org
This paper describes the NWPU-ByteAudio system for CHiME-7 Task 2-unsupervised domain adaptation for conversational speech enhancement (UDASE). To better make use of …
K Saijo, T Ogawa - arXiv preprint arXiv:2309.00376, 2023 - isca-archive.org
We propose an unsupervised approach for training separation models from scratch using RemixIT and Self-Remixing, which are recently proposed self-supervised learning methods …
L Li, S Seki - arXiv preprint arXiv:2406.13982, 2024 - arxiv.org
RemixIT and Remixed2Remixed are domain adaptation-based speech enhancement (DASE) methods that use a teacher model trained in full supervision to generate pseudo …
* Speech corpus compilation in the era of the internet.** Shinnosuke Takamichi (University of Tokyo, Tokyo, 113–8656) e-mail: shinnosuke takamichi@ ipc. iutokyo. ac. jp [doi …