查看文章

hal.science 中的 [PDF]

Multichannel audio source separation with deep neural networks

作者

Aditya Arie Nugraha, Antoine Liutkus, Emmanuel Vincent

发表日期

2015/6/12

报告编号

RR-8740

出版商

INRIA

简介

This article addresses the problem of multichannel audio source separation. We propose a framework where deep neural networks (DNNs) are used to model the source spectra and combined with the classical multichannel Gaussian model to exploit the spatial information. The parameters are estimated in an iterative expectation-maximization (EM) fashion and used to derive a multichannel Wiener filter. We present an extensive experimental study to show the impact of different design choices on the performance of the proposed technique. We consider different cost functions for the training of DNNs, namely the probabilistically motivated Itakura-Saito divergence, and also Kullback-Leibler, Cauchy, mean squared error, and phase-sensitive cost functions. We also study the number of EM iterations and the use of multiple DNNs, where each DNN aims to improve the spectra estimated by the preceding EM iteration …

引用总数

被引用次数：355

2016201720182019202020212022202320248 37 70 53 54 52 37 31 11

学术搜索中的文章

Multichannel audio source separation with deep neural networks

AA Nugraha, A Liutkus, E Vincent - IEEE/ACM Transactions on Audio, Speech, and …, 2016

被引用次数：355 相关文章所有 15 个版本