Joint speaker separation and recognition using non-negative matrix deconvolution with adaptive dictionary

S Drgas, T Virtanen - Computer Speech & Language, 2021 - Elsevier
non-negative matrix deconvolution (NMD) which models spectrogram matrix as a linear
combination of dictionary … While the speaker recognition accuracy is lower for the new approach, …

Limited-data automatic speaker verification algorithm using band-limitedphase-only correlation function

Á PEDROZA, JDELA ROSA… - Turkish Journal of …, 2019 - journals.tubitak.gov.tr
… In this sense, with the aim of validating its use as a new limited-data automatic speaker
verification technique, this paper proposes to use the BLPOC function as a simple matching …

Switching Divergences for Spectral Learning in Blind Speech Dereverberation

FJ Ibarrola, RD Spies… - IEEE/ACM Transactions on …, 2019 - ieeexplore.ieee.org
… Some of these methods make use of non-negativespectrogram. In order to evaluate whether
a given parameter β1 is good for dictionary building, we take a reverberant spectrogram Y , …

Group sparse representation with wavenet vocoder adaptation for spectrum and prosody conversion

B Sisman, M Zhang, H Li - IEEE/ACM Transactions on Audio …, 2019 - ieeexplore.ieee.org
… As a solution to the limited training data problem, nonnegative … (2) decomposes an source
spectrogram into A × ˆH where ˆH … all speaker’s voice into PPGs, and construct dictionaries that …

Automatic sub-word unit discovery and pronunciation lexicon induction for automatic speech recognition with application to under-resourced languages

W Agenbag - 2020 - scholar.sun.ac.za
… Mel spectrogram representation of the elite dictionary of sub-… Wang et al. consider both
normalized cut and non-negative … features for five-way speaker identification. In particular, they …

Dual transform based joint learning single channel speech separation using generative joint dictionary learning

MI Hossain, TH Al Mahmud, MS Islam… - Multimedia Tools and …, 2022 - Springer
… joint dictionaryspectrogram that prepares three parts like real, imaginary and magnitude
for each subband signal. Next, we utilize the GJDL approach for making the joint dictionaries, …

TAU-Net: Temporal activation U-net shared with nonnegative matrix factorization for speech enhancement in unseen noise environments

KM Jeon, GW Lee, NK Kim… - IEEE/ACM Transactions on …, 2021 - ieeexplore.ieee.org
… 2D deconvolution layers to estimate the speech and noise … enhancement and noise-robust
speaker verification,” in Proc. … Venkataramani, “A neural network alternative to nonnegative

Combining F0 and non-negative constraint robust principal component analysis for singing voice separation

F Li, M Akagi - Signal Processing, 2020 - Elsevier
… To obtain the aforementioned reconstructed voice spectrogram E 0 from F0, we define …
showed better results than supervised methods which use online dictionary learning (LRR, LRRi, …

Single channel multi-talker speech separation with deep learning

C Xu - 2020 - dr.ntu.edu.sg
… first solution to solve the speaker verification problem when the test … the non-negative
spectrogram Y ∈ RF×T , as follows, … a basis dictionary by decomposing the clean spectrogram Sc …

Sparse pursuit and dictionary learning for blind source separation in polyphonic music recordings

S Schulze, EJ King - EURASIP Journal on Audio, Speech, and Music …, 2021 - Springer
… Smaragdis [27] introduced NMFD (non-negative matrix factor deconvolution), which is
NMF … of instruments inside the spectrogram, we need an algorithm to approximate a non-negative