A Deep Representation Learning-Based Speech Enhancement Method Using Complex Convolution Recurrent Variational Autoencoder

Y Xiang, J Tian, X Hu, X Xu… - ICASSP 2024-2024 IEEE …, 2024 - ieeexplore.ieee.org
Generally, the performance of deep neural networks (DNNs) heavily depends on the quality
of data representation learning. Our preliminary work has emphasized the significance of …

Model-based noise PSD estimation from speech in non-stationary noise

JK Nielsen, MS Kavalekalam… - … , Speech and Signal …, 2018 - ieeexplore.ieee.org
Most speech enhancement algorithms need an estimate of the noise power spectral density
(PSD) to work. In this paper, we introduce a model-based framework for doing noise PSD …

A two-stage deep representation learning-based speech enhancement method using variational autoencoder and adversarial training

Y Xiang, JL Højvang, MH Rasmussen… - … on Audio, Speech …, 2023 - ieeexplore.ieee.org
This article focuses on leveraging deep representation learning (DRL) for speech
enhancement (SE). In general, the performance of the deep neural network (DNN) is heavily …

RaD-Net 2: A causal two-stage repairing and denoising speech enhancement network with knowledge distillation and complex axial self-attention

M Liu, Z Chen, X Yan, Y Lv, X Xia, C Huang… - arXiv preprint arXiv …, 2024 - arxiv.org
In real-time speech communication systems, speech signals are often degraded by multiple
distortions. Recently, a two-stage Repair-and-Denoising network (RaD-Net) was proposed …

Enhancing Environmental Sound Recognition in Digital Simulations: A Novel Approach to Beamforming and Signal Identification

S Stroud, K Jones, G Edwards, C Robinson… - Proceedings of the …, 2024 - dl.acm.org
This paper advances the field of environmental sound recognition, presenting a refined
approach to beamforming and noise identification through digital simulations of realistic …

Developing Audio Zoom in Virtual Environments: Real-World Soundscapes and Targeted Noise Detection

S Stroud, KO Jones, G Edwards… - 2024 International …, 2024 - ieeexplore.ieee.org
This study presents an innovative beamforming method tailored for audio surveillance
applications, developed through virtual simulations conducted at Liverpool John Moores …

A speech enhancement algorithm based on a non-negative hidden Markov model and Kullback-Leibler divergence

Y Xiang, L Shi, JL Højvang, MH Rasmussen… - EURASIP Journal on …, 2022 - Springer
In this paper, we propose a supervised single-channel speech enhancement method that
combines Kullback-Leibler (KL) divergence-based non-negative matrix factorization (NMF) …

Robust Audio Zoom for Surveillance Systems: A Beamforming Approach with Reduced Microphone Array

S Stroud, KO Jones, G Edwards… - 2023 International …, 2023 - ieeexplore.ieee.org
This paper presents a delay and sum beamforming audio zoom method for addressing
broken microphones in video surveillance systems. The proposed approach utilises a …

Information: Α physical reality or a humanly tool? From the model order to the appropriate number of clusters

M Dendrinos - Journal of Integrated Information …, 2024 - ejournals.epublishing.ekt.gr
This paper is a presentation of two important types of information regarding natural signals
and groups of relative things. The measure of the first information type is the order of the …

[PDF][PDF] Data-driven Speech Enhancement: from Non-negative Matrix Factorization to Deep Representation Learning

Y Xiang - 2022 - vbn.aau.dk
In natural listening environments, speech signals are easily distorted by various acoustic
interference, which reduces the speech quality and intelligibility of human listening; …