A survey of audio enhancement algorithms for music, speech, bioacoustics, biomedical, industrial and environmental sounds by image U-Net

S Gul, MS Khan - IEEE Access, 2023 - ieeexplore.ieee.org
The recent surge in the use of Deep Neural Networks (DNNs) has also made its mark in the
field of Audio Enhancement (AE), providing much better quality than the classical methods …

Blind source separation

GR Naik, W Wang - Berlin: Springer, 2014 - Springer
Blind source separation (BSS) methods have received extensive attention over the past two
decades; thanks to its wide applicability in a number of areas such as biomedical …

A blind source separation framework for ego-noise reduction on multi-rotor drones

L Wang, A Cavallaro - IEEE/ACM Transactions on Audio …, 2020 - ieeexplore.ieee.org
Acoustic sensing from a multi-rotor drone is heavily degraded by the strong ego-noise
produced by the rotating motors and propellers. To address this problem, we propose a …

Over-determined source separation and localization using distributed microphones

L Wang, JD Reiss, A Cavallaro - IEEE/ACM Transactions on …, 2016 - ieeexplore.ieee.org
We propose an overdetermined source separation and localization method for a set of M
microphones distributed around an unknown number, N<; M, of sources. We reformulate the …

An iterative approach to source counting and localization using two distant microphones

L Wang, TK Hon, JD Reiss… - IEEE/ACM transactions …, 2016 - ieeexplore.ieee.org
We propose a time difference of arrival (TDOA) estimation framework based on time-
frequency inter-channel phase difference (IPD) to count and localize multiple acoustic …

Correlation maximization-based sampling rate offset estimation for distributed microphone arrays

L Wang, S Doclo - IEEE/ACM Transactions on Audio, Speech …, 2016 - ieeexplore.ieee.org
In this paper, we investigate the sampling rate mismatch problem in distributed microphone
arrays and propose a correlation maximization algorithm to blindly estimate the sampling …

A proposed method to improve the WER of an ASR system in the noisy reverberant room

ME Sadeghi, H Sheikhzadeh, MJ Emadi - Journal of the Franklin Institute, 2024 - Elsevier
This paper proposes a novel approach to reducing the word error rate (WER) of an
automatic speech recognition (ASR) system in a noisy reverberant room. This research …

Noise power spectral density estimation using MaxNSR blocking matrix

L Wang, T Gerkmann, S Doclo - IEEE/ACM Transactions on …, 2015 - ieeexplore.ieee.org
In this paper, a multi-microphone noise reduction system based on the generalized sidelobe
canceller (GSC) structure is investigated. The system consists of a fixed beamformer …

Multichannel blind music source separation using directivity-aware MNMF with harmonicity constraints

AJ Muñoz-Montoro, JJ Carabias-Orti… - IEEE …, 2022 - ieeexplore.ieee.org
In this paper we present a harmonic constrained Multichannel Non-Negative Matrix
Factorization (MNMF) method for the task of blind music source separation. In this model, the …

Systems, methods, apparatus, and computer-readable media for far-field multi-source tracking and separation

E Visser - US Patent 9,100,734, 2015 - Google Patents
An apparatus for multichannel signal processing separates signal components from different
acoustic sources by initial izing a separation filter bank with beams in the estimated Source …