UNSSOR: unsupervised neural speech separation by leveraging over-determined training mixtures

ZQ Wang, S Watanabe - Advances in Neural Information …, 2024 - proceedings.neurips.cc
In reverberant conditions with multiple concurrent speakers, each microphone acquires a
mixture signal of multiple speakers at a different location. In over-determined conditions …

面向卷积混叠环境下的盲源分离新方法

解元, 邹涛, 孙为军, 谢胜利 - 自动化学报, 2023 - aas.net.cn
卷积混叠环境下的盲源分离(Blind source separation, BSS) 是一个极具挑战性和实际意义的
问题. 本文在独立分量分析框架下, 建立非负矩阵分解(Nonnegative matrix factorization, NMF) …

[PDF][PDF] Weakly-Supervised Neural Full-Rank Spatial Covariance Analysis for a Front-End System of Distant Speech Recognition.

Y Bando, T Aizawa, K Itoyama, K Nakadai - Interspeech, 2022 - isca-archive.org
This paper presents a weakly-supervised multichannel neural speech separation method for
distant speech recognition (DSR) of real conversational speech mixtures. A blind source …

USDnet: Unsupervised Speech Dereverberation via Neural Forward Filtering

ZQ Wang - arXiv preprint arXiv:2402.00820, 2024 - arxiv.org
In reverberant conditions with a single speaker, each far-field microphone records a
reverberant version of the same speaker signal at a different location. In over-determined …

Location as supervision for weakly supervised multi-channel source separation of machine sounds

R Falcon-Perez, G Wichern… - 2023 IEEE Workshop …, 2023 - ieeexplore.ieee.org
In this work, we are interested in learning a model to separate sources that cannot be
recorded in isolation, such as parts of a machine that must run simultaneously in order for …

Neural Fast Full-Rank Spatial Covariance Analysis for Blind Source Separation

Y Bando, Y Masuyama, AA Nugraha… - 2023 31st European …, 2023 - ieeexplore.ieee.org
This paper describes an efficient unsupervised learning method for a neural source
separation model that utilizes a probabilistic generative model of observed multichannel …

Joint separation and localization of moving sound sources based on neural full-rank spatial covariance analysis

H Munakata, Y Bando, R Takeda… - IEEE Signal …, 2023 - ieeexplore.ieee.org
This paper presents an unsupervised multichannel method that can separate moving sound
sources based on an amortized variational inference (AVI) of joint separation and …

FastMVAE2: On improving and accelerating the fast variational autoencoder-based source separation algorithm for determined mixtures

L Li, H Kameoka, S Makino - IEEE/ACM Transactions on Audio …, 2022 - ieeexplore.ieee.org
This article proposes a new source model and training scheme to improve the accuracy and
speed of the multichannel variational autoencoder (MVAE) method. The MVAE method is a …

Neural Blind Source Separation and Diarization for Distant Speech Recognition

Y Bando, T Nakamura, S Watanabe - arXiv preprint arXiv:2406.08396, 2024 - arxiv.org
This paper presents a neural method for distant speech recognition (DSR) that jointly
separates and diarizes speech mixtures without supervision by isolated signals. A standard …

Mixcycle: unsupervised speech separation via cyclic mixture permutation invariant training

E Karamatlı, S Kırbız - IEEE Signal Processing Letters, 2022 - ieeexplore.ieee.org
We introduce two unsupervised source separation methods, which involve self-supervised
training from single-channel two-source speech mixtures. Our first method, mixture …