Emotion recognition of affective speech based on multiple classifiers using acoustic-prosodic information and semantic labels

CH Wu, WB Liang - IEEE Transactions on Affective Computing, 2010 - ieeexplore.ieee.org
This work presents an approach to emotion recognition of affective speech based on
multiple classifiers using acoustic-prosodic information (AP) and semantic labels (SLs). For …

Using non-speech sounds during text-to-speech synthesis

KEA Silverman, M Neeracher - US Patent 8,027,837, 2011 - Google Patents
Abstract Systems, apparatus, methods and computer program products are described for
producing text-to-speech synthesis with non-speech sounds. In general, some of the pauses …

Multi-unit approach to text-to-speech synthesis

M Neeracher, DK Naik, KB Aitken… - US Patent …, 2011 - Google Patents
Methods, apparatus, systems, and computer program products are provided for synthesizing
speech. One method includes matching a first level of units of a received input string to …

Voice conversion using duration-embedded bi-HMMs for expressive speech synthesis

CH Wu, CC Hsia, TH Liu… - IEEE Transactions on …, 2006 - ieeexplore.ieee.org
This paper presents an expressive voice conversion model (DeBi-HMM) as the post
processing of a text-to-speech (TTS) system for expressive speech synthesis. DeBi-HMM is …

A data fusion method for maritime traffic surveillance: The fusion of AIS data and VHF speech information

Y Chen, X Qi, C Huang, J Zheng - Ocean Engineering, 2024 - Elsevier
The more comprehensive information supply for Vessel Traffic Service (VTS), the greater
capacity for maritime traffic surveillance. To supply ship intention information contained in …

Exploiting prosody hierarchy and dynamic features for pitch modeling and generation in HMM-based speech synthesis

CC Hsia, CH Wu, JY Wu - IEEE transactions on audio, speech …, 2010 - ieeexplore.ieee.org
This paper proposes a method for modeling and generating pitch in hidden Markov model
(HMM)-based Mandarin speech synthesis by exploiting prosody hierarchy and dynamic …

Interactive multimedia mirror system design

JR Ding, CL Huang, JK Lin, JF Yang… - IEEE Transactions on …, 2008 - ieeexplore.ieee.org
This investigation describes a novel design and implementation of an interactive multimedia
mirror system, called" magic mirror". The magic mirror implemented in a personal computer …

Efficient and reliable perceptual weight tuning for unit-selection text-to-speech synthesis based on active interactive genetic algorithms: A proof-of-concept

F Alías, L Formiga, X Llorá - Speech Communication, 2011 - Elsevier
Unit-selection speech synthesis is one of the current corpus-based text-to-speech synthesis
techniques. The quality of the generated speech depends on the accuracy of the unit …

Improving HMM speech synthesis of interrogative sentences by pitch track transformations

P Nagy, G Németh - Speech Communication, 2016 - Elsevier
Modeling interrogative sentence prosody is a challenging task due to the significant
variation of questions. Prosody is produced by intonation, intensity and duration features …

Conversion function clustering and selection using linguistic and spectral information for emotional voice conversion

CC Hsia, CH Wu, JQ Wu - IEEE transactions on computers, 2007 - ieeexplore.ieee.org
In emotional speech synthesis, a large speech database is required for high-quality speech
output. Voice conversion needs only a compact-sized speech database for each emotion …