Impact of phase estimation on single-channel speech separation based on time-frequency masking

Y Luo, Z Chen, N Mesgarani - IEEE/ACM Transactions on …, 2018 - ieeexplore.ieee.org

Despite the recent success of deep learning for many speech processing tasks, single-
microphone, speaker-independent speech separation remains challenging for two main …

被引用次数：291 相关文章所有 6 个版本

[PDF] arxiv.org

Conditioned-U-Net: Introducing a control mechanism in the U-Net for multiple source separations

G Meseguer-Brocal, G Peeters - arXiv preprint arXiv:1907.01277, 2019 - arxiv.org

Data-driven models for audio source separation such as U-Net or Wave-U-Net are usually
models dedicated to and specifically trained for a single task, eg a particular instrument …

被引用次数：74 相关文章所有 6 个版本

[PDF] arxiv.org

Deep filtering: Signal extraction and reconstruction using complex time-frequency filters

W Mack, EAP Habets - IEEE Signal Processing Letters, 2019 - ieeexplore.ieee.org

Signal extraction from a single-channel mixture with additional undesired signals is most
commonly performed using time-frequency (TF) masks. Typically, the mask is estimated with …

被引用次数：85 相关文章所有 4 个版本

Phase sensitive masking-based single channel speech enhancement using conditional generative adversarial network

S Routray, Q Mao - Computer Speech & Language, 2022 - Elsevier

We propose PSMGAN, an efficient phase sensitive masking-based single-channel speech
enhancement technique using a conditional generative adversarial network (cGAN). The …

被引用次数：27 相关文章所有 2 个版本

[PDF] arxiv.org

Phase retrieval with Bregman divergences and application to audio signal recovery

PH Vial, P Magron, T Oberlin… - IEEE Journal of Selected …, 2021 - ieeexplore.ieee.org

Phase retrieval (PR) aims to recover a signal from the magnitudes of a set of inner products.
This problem arises in many audio signal processing applications which operate on a short …

被引用次数：25 相关文章所有 12 个版本

Time–frequency masking based supervised speech enhancement framework using fuzzy deep belief network

S Samui, I Chakrabarti, SK Ghosh - Applied Soft Computing, 2019 - Elsevier

In recent years, deep learning based supervised speech enhancement methods have
gained a considerable amount of research attention over the statistical signal processing …

被引用次数：32 相关文章

[PDF] arxiv.org

Speech dereverberation with context-aware recurrent neural networks

JF Santos, TH Falk - IEEE/ACM Transactions on Audio, Speech …, 2018 - ieeexplore.ieee.org

In this paper, we propose a model to perform speech dereverberation by estimating its
spectral magnitude from the reverberant counterpart. Our models are capable of extracting …

被引用次数：39 相关文章所有 5 个版本

[PDF] springer.com

Consistent independent low-rank matrix analysis for determined blind source separation

D Kitamura, K Yatabe - EURASIP journal on advances in signal …, 2020 - Springer

Independent low-rank matrix analysis (ILRMA) is the state-of-the-art algorithm for blind
source separation (BSS) in the determined situation (the number of microphones is greater …

被引用次数：19 相关文章所有 15 个版本

[PDF] isca-archive.org

[PDF][PDF] Single-Channel Dereverberation Using Direct MMSE Optimization and Bidirectional LSTM Networks.

W Mack, S Chakrabarty, FR Stöter, S Braun… - …, 2018 - isca-archive.org

Dereverberation is useful in hands-free communication and voice controlled devices for
distant speech acquisition. Singlechannel dereverberation can be achieved by applying a …

被引用次数：21 相关文章所有 6 个版本

Performance analysis of various training targets for improving speech quality and intelligibility

S Sivapatham, A Kar, R Ramadoss - Applied Acoustics, 2021 - Elsevier

Denoising a single-channel speech (recorded using one microphone) remains an open
problem in many speech-related applications. Recently, supervised deep learning methods …

被引用次数：12 相关文章

高级搜索

QQ 群