Deepfakes as a threat to a speaker and facial recognition: An overview of tools and attack vectors

A Firc, K Malinka, P Hanáček - Heliyon, 2023 - cell.com
Deepfakes present an emerging threat in cyberspace. Recent developments in machine
learning make deepfakes highly believable, and very difficult to differentiate between what is …

Speaker perception

SR Schweinberger, H Kawahara… - Wiley …, 2014 - Wiley Online Library
While humans use their voice mainly for communicating information about the world,
paralinguistic cues in the voice signal convey rich dynamic information about a speaker's …

How do you say 'Hello'? Personality impressions from brief novel voices

P McAleer, A Todorov, P Belin - PloS one, 2014 - journals.plos.org
On hearing a novel voice, listeners readily form personality impressions of that speaker.
Accurate or not, these impressions are known to affect subsequent interactions; yet the …

STRAIGHT, exploitation of the other aspect of VOCODER: Perceptually isomorphic decomposition of speech sounds

H Kawahara - Acoustical science and technology, 2006 - jstage.jst.go.jp
STRAIGHT, a speech analysis, modification synthesis system, is an extension of the
classical channel VOCODER that exploits the advantages of progress in information …

The development of emotion recognition from facial expressions and non‐linguistic vocalizations during childhood

G Chronaki, JA Hadwin, M Garner… - British Journal of …, 2015 - Wiley Online Library
Sensitivity to facial and vocal emotion is fundamental to children's social competence.
Previous research has focused on children's facial emotion recognition, and few studies …

The processing and perception of size information in speech sounds

DRR Smith, RD Patterson, R Turner… - The Journal of the …, 2005 - pubs.aip.org
There is information in speech sounds about the length of the vocal tract; specifically, as a
child grows, the resonators in the vocal tract grow and the formant frequencies of the vowels …

Vocal attractiveness increases by averaging

L Bruckert, P Bestelmeyer, M Latinus, J Rouger… - Current biology, 2010 - cell.com
Vocal attractiveness has a profound influence on listeners—a bias known as the" what
sounds beautiful is good" vocal attractiveness stereotype [1]—with tangible impact on a …

An investigation of multi-speaker training for WaveNet vocoder

T Hayashi, A Tamamori, K Kobayashi… - 2017 IEEE Automatic …, 2017 - ieeexplore.ieee.org
In this paper, we investigate the effectiveness of multi-speaker training for WaveNet vocoder.
In our previous work, we have demonstrated that our proposed speaker-dependent (SD) …

[HTML][HTML] Norm-based coding of voice identity in human auditory cortex

M Latinus, P McAleer, PEG Bestelmeyer, P Belin - Current Biology, 2013 - cell.com
Listeners exploit small interindividual variations around a generic acoustical structure to
discriminate and identify individuals from their voice—a key requirement for social …

Beyond correlation: acoustic transformation methods for the experimental study of emotional voice and speech

P Arias, L Rachman, M Liuni… - Emotion Review, 2021 - journals.sagepub.com
While acoustic analysis methods have become a commodity in voice emotion research,
experiments that attempt not only to describe but to computationally manipulate expressive …