Automatic generation of synthesis units and prosodic information for Chinese concatenative synthesis

Emotion recognition of affective speech based on multiple classifiers using acoustic-prosodic information and semantic labels

CH Wu, WB Liang - IEEE Transactions on Affective Computing, 2010 - ieeexplore.ieee.org

This work presents an approach to emotion recognition of affective speech based on
multiple classifiers using acoustic-prosodic information (AP) and semantic labels (SLs). For …

被引用次数：357 相关文章所有 14 个版本

[PDF] googleapis.com

Using non-speech sounds during text-to-speech synthesis

KEA Silverman, M Neeracher - US Patent 8,027,837, 2011 - Google Patents

Abstract Systems, apparatus, methods and computer program products are described for
producing text-to-speech synthesis with non-speech sounds. In general, some of the pauses …

被引用次数：229 相关文章所有 4 个版本

[PDF] googleapis.com

Multi-unit approach to text-to-speech synthesis

M Neeracher, DK Naik, KB Aitken… - US Patent …, 2011 - Google Patents

Methods, apparatus, systems, and computer program products are provided for synthesizing
speech. One method includes matching a first level of units of a received input string to …

被引用次数：187 相关文章所有 4 个版本

[PDF] researchgate.net

Voice conversion using duration-embedded bi-HMMs for expressive speech synthesis

CH Wu, CC Hsia, TH Liu… - IEEE Transactions on …, 2006 - ieeexplore.ieee.org

This paper presents an expressive voice conversion model (DeBi-HMM) as the post
processing of a text-to-speech (TTS) system for expressive speech synthesis. DeBi-HMM is …

被引用次数：131 相关文章所有 13 个版本

A data fusion method for maritime traffic surveillance: The fusion of AIS data and VHF speech information

Y Chen, X Qi, C Huang, J Zheng - Ocean Engineering, 2024 - Elsevier

The more comprehensive information supply for Vessel Traffic Service (VTS), the greater
capacity for maritime traffic surveillance. To supply ship intention information contained in …

Exploiting prosody hierarchy and dynamic features for pitch modeling and generation in HMM-based speech synthesis

CC Hsia, CH Wu, JY Wu - IEEE transactions on audio, speech …, 2010 - ieeexplore.ieee.org

This paper proposes a method for modeling and generating pitch in hidden Markov model
(HMM)-based Mandarin speech synthesis by exploiting prosody hierarchy and dynamic …

被引用次数：49 相关文章所有 9 个版本

Interactive multimedia mirror system design

JR Ding, CL Huang, JK Lin, JF Yang… - IEEE Transactions on …, 2008 - ieeexplore.ieee.org

This investigation describes a novel design and implementation of an interactive multimedia
mirror system, called" magic mirror". The magic mirror implemented in a personal computer …

被引用次数：26 相关文章所有 6 个版本

[PDF] hal.science

Efficient and reliable perceptual weight tuning for unit-selection text-to-speech synthesis based on active interactive genetic algorithms: A proof-of-concept

F Alías, L Formiga, X Llorá - Speech Communication, 2011 - Elsevier

Unit-selection speech synthesis is one of the current corpus-based text-to-speech synthesis
techniques. The quality of the generated speech depends on the accuracy of the unit …

被引用次数：25 相关文章所有 9 个版本

[PDF] sciencedirect.com

Improving HMM speech synthesis of interrogative sentences by pitch track transformations

P Nagy, G Németh - Speech Communication, 2016 - Elsevier

Modeling interrogative sentence prosody is a challenging task due to the significant
variation of questions. Prosody is produced by intonation, intensity and duration features …

被引用次数：11 相关文章所有 4 个版本

Conversion function clustering and selection using linguistic and spectral information for emotional voice conversion

CC Hsia, CH Wu, JQ Wu - IEEE transactions on computers, 2007 - ieeexplore.ieee.org

In emotional speech synthesis, a large speech database is required for high-quality speech
output. Voice conversion needs only a compact-sized speech database for each emotion …

被引用次数：28 相关文章所有 7 个版本

高级搜索

QQ 群