查看文章

hal.science 中的 [PDF]

Deep neural network based multichannel audio source separation

作者

Aditya Arie Nugraha, Antoine Liutkus, Emmanuel Vincent

发表日期

2018

图书

Audio Source Separation

页码范围

157-185

出版商

Springer, Cham

简介

This chapter presents a multichannel audio source separation framework where deep neural networks (DNNs) are used to model the source spectra and combined with the classical multichannel Gaussian model to exploit the spatial information. The parameters are estimated in an iterative expectation-maximization (EM) fashion and used to derive a multichannel Wiener filter. Different design choices and their impact on the performance are discussed. They include the cost functions for DNN training, the number of parameter updates, the use of multiple DNNs, and the use of weighted parameter updates. Finally, we present its application to a speech enhancement task and a music separation task. The experimental results show the benefit of the multichannel DNN-based approach over a single-channel DNN-based approach and the multichannel nonnegative matrix factorization based iterative EM framework.

引用总数

被引用次数：16

20182019202020212022202320242 2 5 3 2 1 1

学术搜索中的文章

Deep neural network based multichannel audio source separation

AA Nugraha, A Liutkus, E Vincent - Audio Source Separation, 2018

被引用次数：16 相关文章所有 8 个版本