Audiovisual speech synthesis: An overview of the state-of-the-art

W Mattheyses, W Verhelst - Speech Communication, 2015 - Elsevier
We live in a world where there are countless interactions with computer systems in every-
day situations. In the most ideal case, this interaction feels as familiar and as natural as the …

Visual prosody and speech intelligibility: Head movement improves auditory speech perception

KG Munhall, JA Jones, DE Callan… - Psychological …, 2004 - journals.sagepub.com
People naturally move their heads when they speak, and our study shows that this rhythmic
head motion conveys linguistic information. Three-dimensional head and face motion and …

[HTML][HTML] Quantitative association of vocal-tract and facial behavior

H Yehia, P Rubin, E Vatikiotis-Bateson - Speech Communication, 1998 - Elsevier
This paper examines the degrees of correlation among vocal-tract and facial movement data
and the speech acoustics. Multilinear techniques are applied to support the claims that facial …

[PDF][PDF] Audio visual speech recognition

C Neti, G Potamianos, J Luettin, I Matthews, H Glotin… - 2000 - infoscience.epfl.ch
We have made significant progress in automatic speech recognition ASR for well-defined
applications like dictation and medium vocabulary transaction processing tasks in relatively …

Linking facial animation, head motion and speech acoustics

HC Yehia, T Kuratate, E Vatikiotis-Bateson - Journal of phonetics, 2002 - Elsevier
Facial motion during speech is a direct consequence of vocal-tract motion which also
shapes the acoustics of speech. This fact suggests that speech acoustics can be used to …

[图书][B] Handbook of virtual humans

N Magnenat-Thalmann, D Thalmann - 2005 - books.google.com
Virtual Humans are becoming more and more popular and used in many applications such
as the entertainment industry (in both film and games) and medical applications. This …

Emotional expressions in audiovisual human computer interaction

LS Chen, TS Huang - … Proceedings. Latest Advances in the Fast …, 2000 - ieeexplore.ieee.org
Visual and auditory modalities are two of the most commonly used media in interactions
between humans. The authors describe a system to continuously monitor the user's voice …

Spatial and Temporal Constraints on Audiovisual Speech Perception.

KG Munhall, E Vatikiotis-Bateson - 2004 - psycnet.apa.org
In this chapter we address the audiovisual integration of information for speech. We first
summarize the spatial and temporal features of normal speech communication and the …

Talking heads-Models and applications for multimodal speech synthesis

J Beskow - 2003 - diva-portal.org
This thesis presents work in the area of computer animated talking heads. A system for
multimodal speech synthesis has been developed, capable of generating audiovisual …

[PDF][PDF] Audio-visual synthesis of talking faces from speech production correlates.

T Kuratate, KG Munhall, P Rubin, E Vatikiotis-Bateson… - Eurospeech, 1999 - Citeseer
This paper presents technical refinements and extensions of our system for correlating
audible and visible components of speech behavior and subsequently using those …