查看文章

hal.science 中的 [PDF]

Cauchy multichannel speech enhancement with a deep speech prior

作者

Mathieu Fontaine, Aditya Arie Nugraha, Roland Badeau, Kazuyoshi Yoshii, Antoine Liutkus

发表日期

2019/9/2

研讨会论文

2019 27th European Signal Processing Conference (EUSIPCO)

页码范围

1-5

出版商

IEEE

简介

We propose a semi-supervised multichannel speech enhancement system based on a probabilistic model which assumes that both speech and noise follow the heavy-tailed multi-variate complex Cauchy distribution. As we advocate, this allows handling strong and adverse noisy conditions. Consequently, the model is parameterized by the source magnitude spectrograms and the source spatial scatter matrices. To deal with the non-additivity of scatter matrices, our first contribution is to perform the enhancement on a projected space. Then, our second contribution is to combine a latent variable model for speech, which is trained by following the variational autoencoder framework, with a low-rank model for the noise source. At test time, an iterative inference algorithm is applied, which produces estimated parameters to use for separation. The speech latent variables are estimated first from the noisy speech and then …

引用总数

被引用次数：6

2020202120222 1 3

学术搜索中的文章

Cauchy multichannel speech enhancement with a deep speech prior

M Fontaine, AA Nugraha, R Badeau, K Yoshii… - 2019 27th European Signal Processing Conference …, 2019

被引用次数：6 相关文章所有 10 个版本