Bandwidth extension for hierarchical speech and audio coding in ITU-T Rec. G. 729.1

B Geiser, P Jax, P Vary, H Taddei… - … on Audio, Speech …, 2007 - ieeexplore.ieee.org
Recommendation G. 729.1 is a new ITU-T standard which was approved in May 2006. This
recommendation describes a hierarchical speech and audio coding algorithm built on top of …

Source and filter estimation for throat-microphone speech enhancement

MAT Turan, E Erzin - IEEE/ACM Transactions on Audio, Speech …, 2015 - ieeexplore.ieee.org
In this paper, we propose a new statistical enhancement system for throat microphone
recordings through source and filter separation. Throat microphones (TM) are skin-attached …

Evaluation of an artificial speech bandwidth extension method in three languages

H Pulakka, L Laaksonen, M Vainio… - IEEE transactions on …, 2008 - ieeexplore.ieee.org
Quality and intelligibility of narrowband telephone speech can be improved by artificial
bandwidth extension (ABE), which extends the speech bandwidth using only information …

Artificial bandwidth extension of spectral envelope along a Viterbi path

C Yağlı, MAT Turan, E Erzin - Speech communication, 2013 - Elsevier
In this paper, we propose a hidden Markov model (HMM)-based wideband spectral
envelope estimation method for the artificial bandwidth extension problem. The proposed …

Adaptive, scalable packet loss recovery

C Feldbauer, WB Kleijn - US Patent 8,325,622, 2012 - Google Patents
(57) ABSTRACT A system for transmitting data packets representing a source signal across
a packet data network is provided. The encoder comprises a first encoder (110) and a …

Enhancement of throat microphone recordings by learning phone-dependent mappings of speech spectra

MAT Turan, E Erzin - 2013 IEEE International Conference on …, 2013 - ieeexplore.ieee.org
We investigate spectral envelope mapping problem with joint analysis of throat-and acoustic-
microphone recordings to enhance throatmicrophone speech. A new phone-dependent …

Conditional vector quantization for voice conversion

A Mouchtaris, Y Agiomyrgiannakis… - … on Acoustics, Speech …, 2007 - ieeexplore.ieee.org
Voice conversion methods have the objective of transforming speech spoken by a particular
source speaker, so that it sounds as if spoken by a different target speaker. The majority of …

Enhancement of throat microphone recordings using gaussian mixture model probabilistic estimator

MAT Turan - arXiv preprint arXiv:1804.05937, 2018 - arxiv.org
The throat microphone is a body-attached transducer that is worn against the neck. It
captures the signals that are transmitted through the vocal folds, along with the buzz tone of …

Deep Conditional Measure Quantization

G Turinici - … Conference on Optimization, Learning Algorithms and …, 2023 - Springer
Quantization of a probability measure means representing it with a finite set of Dirac masses
that approximates the input distribution well enough (in some metric space of probability …

Children's Speech Recognition Under Mismatched Condition: A Review

Y Sunil, SRM Prasanna, R Sinha - IETE Journal of Education, 2016 - Taylor & Francis
Automatic speech recognition (ASR) is a task of converting speech to text. In this article, the
ASR system trained using the speech of adult speakers and tested using the speech of child …