Multichannel audio source separation with deep neural networks

DL Wang, J Chen - IEEE/ACM transactions on audio, speech …, 2018 - ieeexplore.ieee.org

Speech separation is the task of separating target speech from background interference.
Traditionally, speech separation is studied as a signal processing problem. A more recent …

被引用次数：1488 相关文章所有 14 个版本

[PDF] cambridge.org

A review of blind source separation methods: two converging routes to ILRMA originating from ICA and NMF

H Sawada, N Ono, H Kameoka, D Kitamura… - … Transactions on Signal …, 2019 - cambridge.org

This paper describes several important methods for the blind source separation of audio
signals in an integrated manner. Two historically developed routes are featured. One started …

被引用次数：121 相关文章所有 3 个版本

[HTML] aip.org

[HTML][HTML] A survey of sound source localization with deep learning methods

PA Grumiaux, S Kitić, L Girin, A Guérin - The Journal of the Acoustical …, 2022 - pubs.aip.org

This article is a survey of deep learning methods for single and multiple sound source
localization, with a focus on sound source localization in indoor environments, where …

被引用次数：204 相关文章所有 13 个版本

[PDF] arxiv.org

Deep learning for audio signal processing

H Purwins, B Li, T Virtanen, J Schlüter… - IEEE Journal of …, 2019 - ieeexplore.ieee.org

Given the recent surge in developments of deep learning, this paper provides a review of the
state-of-the-art deep learning techniques for audio signal processing. Speech, music, and …

被引用次数：799 相关文章所有 7 个版本

[HTML] aip.org

[HTML][HTML] Machine learning in acoustics: Theory and applications

MJ Bianco, P Gerstoft, J Traer, E Ozanich… - The Journal of the …, 2019 - pubs.aip.org

Acoustic data provide scientific and engineering insights in fields ranging from biology and
communications to ocean and Earth science. We survey the recent advances and …

被引用次数：480 相关文章所有 14 个版本

[PDF] arxiv.org

Wave-u-net: A multi-scale neural network for end-to-end audio source separation

D Stoller, S Ewert, S Dixon - arXiv preprint arXiv:1806.03185, 2018 - arxiv.org

Models for audio source separation usually operate on the magnitude spectrum, which
ignores phase information and makes separation performance dependant on hyper …

被引用次数：741 相关文章所有 11 个版本

[PDF] hal.science

A consolidated perspective on multimicrophone speech enhancement and source separation

S Gannot, E Vincent… - … /ACM Transactions on …, 2017 - ieeexplore.ieee.org

Speech enhancement and separation are core problems in audio signal processing, with
commercial applications in devices as diverse as mobile phones, conference call systems …

被引用次数：591 相关文章所有 12 个版本

[PDF] hal.science

An analysis of environment, microphone and data simulation mismatches in robust speech recognition

E Vincent, S Watanabe, AA Nugraha, J Barker… - Computer Speech & …, 2017 - Elsevier

Speech enhancement and automatic speech recognition (ASR) are most often evaluated in
matched (or multi-condition) settings where the acoustic conditions of the training data …

被引用次数：410 相关文章所有 16 个版本

[PDF] thecvf.com

Self-supervised moving vehicle tracking with stereo sound

C Gan, H Zhao, P Chen, D Cox… - Proceedings of the …, 2019 - openaccess.thecvf.com

Humans are able to localize objects in the environment using both visual and auditory cues,
integrating information from multiple modalities into a common reference frame. We …

被引用次数：166 相关文章所有 8 个版本

[PDF] arxiv.org

Improved speech enhancement with the wave-u-net

C Macartney, T Weyde - arXiv preprint arXiv:1811.11307, 2018 - arxiv.org

We study the use of the Wave-U-Net architecture for speech enhancement, a model
introduced by Stoller et al for the separation of music vocals and accompaniment. This end …

被引用次数：203 相关文章所有 3 个版本

高级搜索

QQ 群