Speech enhancement using self-supervised pre-trained model and vector quantization

XY Zhao, QS Zhu, J Zhang - 2022 Asia-Pacific Signal and …, 2022 - ieeexplore.ieee.org
With the development of deep learning, neural network-based speech enhancement (SE)
models have shown excellent performance. Meanwhile, it was shown that the development …

Closed-Form Solution to the Multichannel Wiener Filter With Interaural Level Difference Preservation

DM Do Carmo, R Borsoi… - IEEE/ACM Transactions on …, 2023 - ieeexplore.ieee.org
This article presents a multichannel Wiener filter (MWF) based noise reduction method with
preservation of the interaural level difference (ILD). It minimizes the MWF cost function …

A Study of Multichannel Spatiotemporal Features and Knowledge Distillation on Robust Target Speaker Extraction

Y Wang, J Zhang, S Chen, W Zhang… - ICASSP 2024-2024 …, 2024 - ieeexplore.ieee.org
Target speaker extraction (TSE) based on direction of arrival (DOA) has a wide range of
applications in eg, remote conferencing, hearing aids, in-car speech interaction. Due to the …

Speech Enhancement with Multi-granularity Vector Quantization

X Zhao, Q Zhu, J Zhang, Y Zhou… - 2023 Asia Pacific Signal …, 2023 - ieeexplore.ieee.org
Neural network based speech enhancement (SE) has developed rapidly in the last decade.
Meanwhile, the self-supervised pre-trained model and vector quantization (VQ) has …