Personalized neural speech codec

I Jang, H Yang, W Lim, S Beack… - ICASSP 2024-2024 IEEE …, 2024 - ieeexplore.ieee.org
In this paper, we propose a personalized neural speech codec, envisioning that
personalization can reduce the model complexity or improve perceptual speech quality …

Efficient personalized speech enhancement through self-supervised learning

A Sivaraman, M Kim - IEEE Journal of Selected Topics in Signal …, 2022 - ieeexplore.ieee.org
This work presents self-supervised learning methods for monaural speaker-specific (ie,
personalized) speech enhancement models. While general-purpose models must broadly …

Test-time adaptation toward personalized speech enhancement: Zero-shot learning with knowledge distillation

S Kim, M Kim - 2021 IEEE Workshop on Applications of Signal …, 2021 - ieeexplore.ieee.org
In realistic speech enhancement settings for end-user devices, we often encounter only a
few speakers and noise types that tend to reoccur in the specific acoustic environment. We …

Triple-0: Zero-shot denoising and dereverberation on an end-to-end frozen anechoic speech separation network

S Gul, MS Khan, A Ur-Rehman - Plos one, 2024 - journals.plos.org
Speech enhancement is crucial both for human and machine listening applications. Over the
last decade, the use of deep learning for speech enhancement has resulted in tremendous …

Personalized Speech Enhancement Without a Separate Speaker Embedding Model

T Pärnamaa, A Saabas - arXiv preprint arXiv:2406.09928, 2024 - arxiv.org
Personalized speech enhancement (PSE) models can improve the audio quality of
teleconferencing systems by adapting to the characteristics of a speaker's voice. However …

The Potential of Neural Speech Synthesis-Based Data Augmentation for Personalized Speech Enhancement

A Kuznetsova, A Sivaraman… - ICASSP 2023-2023 IEEE …, 2023 - ieeexplore.ieee.org
With the advances in deep learning, speech enhancement systems benefited from large
neural network architectures and achieved state-of-the-art quality. However, speaker …

OSSEM: one-shot speaker adaptive speech enhancement using meta learning

C Yu, SW Fu, TA Hsieh, Y Tsao, M Ravanelli - arXiv preprint arXiv …, 2021 - arxiv.org
Although deep learning (DL) has achieved notable progress in speech enhancement (SE),
further research is still required for a DL-based SE system to adapt effectively and efficiently …

Resource-Efficient Model Adaptation Methods for Personalized Speech Enhancement Systems

A Sivaraman - 2024 - search.proquest.com
This dissertation introduces several machine learning algorithms for developing
personalized speech enhancement (PSE) systems. In particular, we investigate the data …

Zero-shot test time adaptation via knowledge distillation for personalized speech denoising and dereverberation

S Kim, M Athi, G Shi, M Kim… - The Journal of the …, 2024 - pubs.aip.org
A personalization framework to adapt compact models to test time environments and
improve their speech enhancement (SE) performance in noisy and reverberant conditions is …

Learning when to trust which teacher for weakly supervised ASR

A Agrawal, M Rao, AK Sahu, G Chennupati… - arXiv preprint arXiv …, 2023 - arxiv.org
Automatic speech recognition (ASR) training can utilize multiple experts as teacher models,
each trained on a specific domain or accent. Teacher models may be opaque in nature …