Statistical parametric speech synthesis

H Zen, K Tokuda, AW Black - speech communication, 2009 - Elsevier
This review gives a general overview of techniques used in statistical parametric speech
synthesis. One instance of these techniques, called hidden Markov model (HMM)-based …

Deep learning for acoustic modeling in parametric speech generation: A systematic review of existing techniques and future trends

ZH Ling, SY Kang, H Zen, A Senior… - IEEE Signal …, 2015 - ieeexplore.ieee.org
Hidden Markov models (HMMs) and Gaussian mixture models (GMMs) are the two most
common types of acoustic models used in statistical parametric approaches for generating …

Statistical parametric speech synthesis using deep neural networks

H Zen, A Senior, M Schuster - 2013 ieee international …, 2013 - ieeexplore.ieee.org
Conventional approaches to statistical parametric speech synthesis typically use decision
tree-clustered context-dependent hidden Markov models (HMMs) to represent probability …

A deep learning approach for generalized speech animation

S Taylor, T Kim, Y Yue, M Mahler, J Krahe… - ACM Transactions On …, 2017 - dl.acm.org
We introduce a simple and effective deep learning approach to automatically generate
natural looking speech animation that synchronizes to input speech. Our approach uses a …

Speech parameter generation algorithms for HMM-based speech synthesis

K Tokuda, T Yoshimura, T Masuko… - … on acoustics, speech …, 2000 - ieeexplore.ieee.org
This paper derives a speech parameter generation algorithm for HMM-based speech
synthesis, in which the speech parameter sequence is generated from HMMs whose …

[PDF][PDF] Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis

T Yoshimura, K Tokuda, T Masuko… - Sixth European …, 1999 - isca-archive.org
In this paper, we describe an HMM-based speech synthesis system in which spectrum, pitch
and state duration are modeled simultaneously in a unified framework of HMM. In the …

Speech synthesis based on hidden Markov models

K Tokuda, Y Nankaku, T Toda, H Zen… - Proceedings of the …, 2013 - ieeexplore.ieee.org
This paper gives a general overview of hidden Markov model (HMM)-based speech
synthesis, which has recently been demonstrated to be very effective in synthesizing …

Two-pass decoding for speech recognition of search and action requests

E Weinstein, A Waters - US Patent 8,645,138, 2014 - Google Patents
Disclosed are apparatus and methods for processing spoken speech. Input speech can be
received at a computing system. During a first pass of speech recognition, a plurality of …

Multi-space probability distribution HMM

K Tokuda, T Masuko, N Miyazaki… - … on Information and …, 2002 - search.ieice.org
This paper proposes a new kind of hidden Markov model (HMM) based on multi-space
probability distribution, and derives a parameter estimation algorithm for the extended HMM …

A hidden semi-Markov model-based speech synthesis system

H Zen, K Tokuda, T Masuko, T Kobayasih… - IEICE transactions on …, 2007 - search.ieice.org
A statistical speech synthesis system based on the hidden Markov model (HMM) was
recently proposed. In this system, spectrum, excitation, and duration of speech are modeled …