Scalable and efficient neural speech coding: A hybrid design

K Zhen, J Sung, MS Lee, S Beack… - IEEE/ACM Transactions …, 2021 - ieeexplore.ieee.org
We present a scalable and efficient neural waveform coding system for speech
compression. We formulate the speech coding problem as an autoencoding task, where a …

Efficient and scalable neural residual waveform coding with collaborative quantization

K Zhen, MS Lee, J Sung, S Beack… - ICASSP 2020-2020 …, 2020 - ieeexplore.ieee.org
Scalability and efficiency are desired in neural speech codecs, which supports a wide range
of bitrates for applications on various devices. We propose a collaborative quantization (CQ) …

Latent-domain predictive neural speech coding

X Jiang, X Peng, H Xue, Y Zhang… - IEEE/ACM Transactions …, 2023 - ieeexplore.ieee.org
Neural audio/speech coding has recently demonstrated its capability to deliver high quality
at much lower bitrates than traditional methods. However, existing neural audio/speech …

Source-aware neural speech coding for noisy speech compression

H Yang, K Zhen, S Beack, M Kim - ICASSP 2021-2021 IEEE …, 2021 - ieeexplore.ieee.org
This paper introduces a novel neural network-based speech coding system that can process
noisy speech effectively. The proposed source-aware neural audio coding (SANAC) system …

Nesc: Robust neural end-2-end speech coding with gans

N Pia, K Gupta, S Korse, M Multrus, G Fuchs - arXiv preprint arXiv …, 2022 - arxiv.org
Neural networks have proven to be a formidable tool to tackle the problem of speech coding
at very low bit rates. However, the design of a neural coder that can be operated robustly …

PromptCodec: High-Fidelity Neural Speech Codec using Disentangled Representation Learning based Adaptive Feature-aware Prompt Encoders

Y Pan, L Ma, J Zhao - arXiv preprint arXiv:2404.02702, 2024 - arxiv.org
Neural speech codec has recently gained widespread attention in generative speech
modeling domains, like voice conversion, text-to-speech synthesis, etc. However, ensuring …

Ultra-low-bitrate speech coding with pretrained transformers

A Siahkoohi, M Chinen, T Denton, WB Kleijn… - arXiv preprint arXiv …, 2022 - arxiv.org
Speech coding facilitates the transmission of speech over low-bandwidth networks with
minimal distortion. Neural-network based speech codecs have recently demonstrated …

CQNV: A combination of coarsely quantized bitstream and neural vocoder for low rate speech coding

Y Zheng, L Xiao, W Tu, Y Yang, X Xu - arXiv preprint arXiv:2307.13295, 2023 - arxiv.org
Recently, speech codecs based on neural networks have proven to perform better than
traditional methods. However, redundancy in traditional parameter quantization is visible …

An intra-BRNN and GB-RVQ based end-to-end neural audio codec

L Xu, J Jiang, D Zhang, X Xia, L Chen, Y Xiao… - arXiv preprint arXiv …, 2024 - arxiv.org
Recently, neural networks have proven to be effective in performing speech coding task at
low bitrates. However, under-utilization of intra-frame correlations and the error of quantizer …

Variational speech waveform compression to catalyze semantic communications

S Yao, Z Xiao, S Wang, J Dai, K Niu… - 2023 IEEE Wireless …, 2023 - ieeexplore.ieee.org
We propose a novel neural waveform compression method to catalyze emerging speech
semantic communications. By introducing nonlinear transform and variational modeling, we …