Traditional parametric coding of speech facilitates low rate but provides poor reconstruction quality because of the inadequacy of the model used. We describe how a WaveNet …
KK Paliwal, BS Atal - IEEE transactions on speech and audio …, 1993 - ieeexplore.ieee.org
For low bit rate speech coding applications, it is important to quantize the LPC parameters accurately using as few bits as possible. Though vector quantizers are more efficient than …
A Spanias, T Painter, V Atti - 2006 - books.google.com
An in-depth treatment of algorithms and standards for perceptual coding of high-fidelity audio, this self-contained reference surveys and addresses all aspects of the field. Coverage …
A Gersho - Proceedings of the IEEE, 1994 - ieeexplore.ieee.org
Speech and audio compression has advanced rapidly in recent years spurred on by cost- effective digital technology and diverse commercial applications. Recent activity in speech …
P Kroon, E Deprettere, R Sluyter - IEEE transactions on …, 1986 - ieeexplore.ieee.org
This paper describes an effective and efficient time domain speech encoding technique that has an appealing low complexity, and produces toll quality speech at rates below 16 kbits/s …
KK Paliwal, BS Atal - Speech and Audio Coding for Wireless and Network …, 1993 - Springer
Linear predictive coding (LPC) parameters are widely used in various speech coding applications for representing the short-time spectral envelope information of speech [1]. For …
A Practical Handbook of Speech Coders offers in-depth treatment of the basics of speech coding plus the innovations to the basic methods that make the coders useful and efficient. It …
RP Ramachandran, P Kabal - IEEE Transactions on Acoustics …, 1989 - ieeexplore.ieee.org
Prediction error filters which combine short-time prediction (formant prediction) with long- time prediction (pitch prediction) in a cascade connection are examined. A number of …
WB Kleijn - IEEE transactions on speech and audio processing, 1993 - ieeexplore.ieee.org
Voiced speech is interpreted as a concentration of slowly evolving pitch-cycle waveforms. This signal can be reconstructed by interpolation from a downsampled sequence of pitch …