High-quality speech coding with sample RNN

J Klejsa, P Hedelin, C Zhou, R Fejgin… - ICASSP 2019-2019 …, 2019 - ieeexplore.ieee.org
We provide a speech coding scheme employing a generative model based on SampleRNN
that, while operating at significantly lower bitrates, matches or surpasses the perceptual …

Nesc: Robust neural end-2-end speech coding with gans

N Pia, K Gupta, S Korse, M Multrus, G Fuchs - arXiv preprint arXiv …, 2022 - arxiv.org
Neural networks have proven to be a formidable tool to tackle the problem of speech coding
at very low bit rates. However, the design of a neural coder that can be operated robustly …

Cascaded cross-module residual learning towards lightweight end-to-end speech coding

K Zhen, J Sung, MS Lee, S Beack, M Kim - arXiv preprint arXiv …, 2019 - arxiv.org
Speech codecs learn compact representations of speech signals to facilitate data
transmission. Many recent deep neural network (DNN) based end-to-end speech codecs …

CQNV: A combination of coarsely quantized bitstream and neural vocoder for low rate speech coding

Y Zheng, L Xiao, W Tu, Y Yang, X Xu - arXiv preprint arXiv:2307.13295, 2023 - arxiv.org
Recently, speech codecs based on neural networks have proven to perform better than
traditional methods. However, redundancy in traditional parameter quantization is visible …

Low bit-rate speech coding with VQ-VAE and a WaveNet decoder

C Gârbacea, A van den Oord, Y Li… - ICASSP 2019-2019 …, 2019 - ieeexplore.ieee.org
In order to efficiently transmit and store speech signals, speech codecs create a minimally
redundant representation of the input signal which is then decoded at the receiver with the …

Generative speech coding with predictive variance regularization

WB Kleijn, A Storus, M Chinen, T Denton… - ICASSP 2021-2021 …, 2021 - ieeexplore.ieee.org
The recent emergence of machine-learning based generative models for speech suggests a
significant reduction in bit rate for speech codecs is possible. However, the performance of …

Funcodec: A fundamental, reproducible and integrable open-source toolkit for neural speech codec

Z Du, S Zhang, K Hu, S Zheng - ICASSP 2024-2024 IEEE …, 2024 - ieeexplore.ieee.org
This paper presents FunCodec, a fundamental neural speech codec toolkit, which is an
extension of the open-source speech processing toolkit FunASR. FunCodec provides …

Source-aware neural speech coding for noisy speech compression

H Yang, K Zhen, S Beack, M Kim - ICASSP 2021-2021 IEEE …, 2021 - ieeexplore.ieee.org
This paper introduces a novel neural network-based speech coding system that can process
noisy speech effectively. The proposed source-aware neural audio coding (SANAC) system …

Enhancing into the codec: Noise robust speech coding with vector-quantized autoencoders

J Casebeer, V Vale, U Isik, JM Valin… - ICASSP 2021-2021 …, 2021 - ieeexplore.ieee.org
Audio codecs based on discretized neural autoencoders have recently been developed and
shown to provide significantly higher compression levels for comparable quality speech out …

A streamwise gan vocoder for wideband speech coding at very low bit rate

A Mustafa, J Büthe, S Korse, K Gupta… - … IEEE Workshop on …, 2021 - ieeexplore.ieee.org
Recently, GAN vocoders have seen rapid progress in speech synthesis, starting to
outperform autoregressive models in perceptual quality with much higher generation speed …