Speaker-independent speech separation with deep attractor network

Y Luo, Z Chen, N Mesgarani - IEEE/ACM Transactions on …, 2018 - ieeexplore.ieee.org
Despite the recent success of deep learning for many speech processing tasks, single-
microphone, speaker-independent speech separation remains challenging for two main …

Conditioned-U-Net: Introducing a control mechanism in the U-Net for multiple source separations

G Meseguer-Brocal, G Peeters - arXiv preprint arXiv:1907.01277, 2019 - arxiv.org
Data-driven models for audio source separation such as U-Net or Wave-U-Net are usually
models dedicated to and specifically trained for a single task, eg a particular instrument …

Deep filtering: Signal extraction and reconstruction using complex time-frequency filters

W Mack, EAP Habets - IEEE Signal Processing Letters, 2019 - ieeexplore.ieee.org
Signal extraction from a single-channel mixture with additional undesired signals is most
commonly performed using time-frequency (TF) masks. Typically, the mask is estimated with …

Phase sensitive masking-based single channel speech enhancement using conditional generative adversarial network

S Routray, Q Mao - Computer Speech & Language, 2022 - Elsevier
We propose PSMGAN, an efficient phase sensitive masking-based single-channel speech
enhancement technique using a conditional generative adversarial network (cGAN). The …

Phase retrieval with Bregman divergences and application to audio signal recovery

PH Vial, P Magron, T Oberlin… - IEEE Journal of Selected …, 2021 - ieeexplore.ieee.org
Phase retrieval (PR) aims to recover a signal from the magnitudes of a set of inner products.
This problem arises in many audio signal processing applications which operate on a short …

Time–frequency masking based supervised speech enhancement framework using fuzzy deep belief network

S Samui, I Chakrabarti, SK Ghosh - Applied Soft Computing, 2019 - Elsevier
In recent years, deep learning based supervised speech enhancement methods have
gained a considerable amount of research attention over the statistical signal processing …

Speech dereverberation with context-aware recurrent neural networks

JF Santos, TH Falk - IEEE/ACM Transactions on Audio, Speech …, 2018 - ieeexplore.ieee.org
In this paper, we propose a model to perform speech dereverberation by estimating its
spectral magnitude from the reverberant counterpart. Our models are capable of extracting …

Consistent independent low-rank matrix analysis for determined blind source separation

D Kitamura, K Yatabe - EURASIP journal on advances in signal …, 2020 - Springer
Independent low-rank matrix analysis (ILRMA) is the state-of-the-art algorithm for blind
source separation (BSS) in the determined situation (the number of microphones is greater …

[PDF][PDF] Single-Channel Dereverberation Using Direct MMSE Optimization and Bidirectional LSTM Networks.

W Mack, S Chakrabarty, FR Stöter, S Braun… - …, 2018 - isca-archive.org
Dereverberation is useful in hands-free communication and voice controlled devices for
distant speech acquisition. Singlechannel dereverberation can be achieved by applying a …

Performance analysis of various training targets for improving speech quality and intelligibility

S Sivapatham, A Kar, R Ramadoss - Applied Acoustics, 2021 - Elsevier
Denoising a single-channel speech (recorded using one microphone) remains an open
problem in many speech-related applications. Recently, supervised deep learning methods …