Nesc: Robust neural end-2-end speech coding with gans

N Pia, K Gupta, S Korse, M Multrus, G Fuchs - arXiv preprint arXiv …, 2022 - arxiv.org
Neural networks have proven to be a formidable tool to tackle the problem of speech coding
at very low bit rates. However, the design of a neural coder that can be operated robustly …

Scalable and efficient neural speech coding: A hybrid design

K Zhen, J Sung, MS Lee, S Beack… - IEEE/ACM Transactions …, 2021 - ieeexplore.ieee.org
We present a scalable and efficient neural waveform coding system for speech
compression. We formulate the speech coding problem as an autoencoding task, where a …

A streamwise gan vocoder for wideband speech coding at very low bit rate

A Mustafa, J Büthe, S Korse, K Gupta… - … IEEE Workshop on …, 2021 - ieeexplore.ieee.org
Recently, GAN vocoders have seen rapid progress in speech synthesis, starting to
outperform autoregressive models in perceptual quality with much higher generation speed …

Cascaded cross-module residual learning towards lightweight end-to-end speech coding

K Zhen, J Sung, MS Lee, S Beack, M Kim - arXiv preprint arXiv …, 2019 - arxiv.org
Speech codecs learn compact representations of speech signals to facilitate data
transmission. Many recent deep neural network (DNN) based end-to-end speech codecs …

An intra-BRNN and GB-RVQ based end-to-end neural audio codec

L Xu, J Jiang, D Zhang, X Xia, L Chen, Y Xiao… - arXiv preprint arXiv …, 2024 - arxiv.org
Recently, neural networks have proven to be effective in performing speech coding task at
low bitrates. However, under-utilization of intra-frame correlations and the error of quantizer …

Ultra-low-bitrate speech coding with pretrained transformers

A Siahkoohi, M Chinen, T Denton, WB Kleijn… - arXiv preprint arXiv …, 2022 - arxiv.org
Speech coding facilitates the transmission of speech over low-bandwidth networks with
minimal distortion. Neural-network based speech codecs have recently demonstrated …

PromptCodec: High-Fidelity Neural Speech Codec using Disentangled Representation Learning based Adaptive Feature-aware Prompt Encoders

Y Pan, L Ma, J Zhao - arXiv preprint arXiv:2404.02702, 2024 - arxiv.org
Neural speech codec has recently gained widespread attention in generative speech
modeling domains, like voice conversion, text-to-speech synthesis, etc. However, ensuring …

Latent-domain predictive neural speech coding

X Jiang, X Peng, H Xue, Y Zhang… - IEEE/ACM Transactions …, 2023 - ieeexplore.ieee.org
Neural audio/speech coding has recently demonstrated its capability to deliver high quality
at much lower bitrates than traditional methods. However, existing neural audio/speech …

High-quality speech coding with sample RNN

J Klejsa, P Hedelin, C Zhou, R Fejgin… - ICASSP 2019-2019 …, 2019 - ieeexplore.ieee.org
We provide a speech coding scheme employing a generative model based on SampleRNN
that, while operating at significantly lower bitrates, matches or surpasses the perceptual …

Source-aware neural speech coding for noisy speech compression

H Yang, K Zhen, S Beack, M Kim - ICASSP 2021-2021 IEEE …, 2021 - ieeexplore.ieee.org
This paper introduces a novel neural network-based speech coding system that can process
noisy speech effectively. The proposed source-aware neural audio coding (SANAC) system …