Can we use speaker recognition technology to attack itself? enhancing mimicry attacks using automatic target speaker selection

T Kinnunen, RG Hautamäki, V Vestman… - ICASSP 2019-2019 …, 2019 - ieeexplore.ieee.org
We consider technology-assisted mimicry attacks in the context of automatic speaker
verification (ASV). We use ASV itself to select targeted speakers to be attacked by human …

Black-box attacks on automatic speaker verification using feedback-controlled voice conversion

X Tian, RK Das, H Li - arXiv preprint arXiv:1909.07655, 2019 - arxiv.org
Automatic speaker verification (ASV) systems in practice are greatly vulnerable to spoofing
attacks. The latest voice conversion technologies are able to produce perceptually natural …

Eliminating data collection bottleneck for wake word engine training using found and synthetic data

B Ramanan, L Drabeck, T Woo… - … Conference on Big …, 2019 - ieeexplore.ieee.org
Voice interfaces are fast becoming an important human-machine interfaces, and Wake Word
Engines (WWEs) are a critical part of modern voice interfaces. There are recent …

Voice spoofing detection with raw waveform based on Dual Path Res2net

X Fang, H Du, T Gao, L Zou, Z Ling - 5th International Conference on …, 2021 - dl.acm.org
The natural-sounding speech produced by recent text-to-speech and voice conversion
techniques pose serious threats to automatic speaker verification systems. The majority of …

Voice impersonation for thai speech using cyclegan over prosody

C Chuangulueam, B Kijsirikul… - Proceedings of the 4th …, 2022 - dl.acm.org
Voice impersonation can be a challenging task for mimicking all aspect of the target
speaker. This paper proposes a prosody conversion using a cycle-consistent adversarial …

Voice transformation using two-level dynamic warping and neural networks

AW Al-Dulaimi, TK Moon, JH Gunther - Signals, 2021 - mdpi.com
Voice transformation, for example, from a male speaker to a female speaker, is achieved
here using a two-level dynamic warping algorithm in conjunction with an artificial neural …

Machine learning for limited data voice conversion

B Sisman - 2019 - search.proquest.com
Voice Conversion aims to convert one's voice to sound like that of another. This thesis is
focused on developing advanced machine learning algorithms and frameworks for voice …

[图书][B] AI-Synthesized Speech: Generation and Detection

EAA Abdrabuh - 2022 - search.proquest.com
From speech to images, and videos, advances in machine learning have led to dramatic
improvements in the quality and realism of so-called AI-synthesized content. While there are …

Which gesture generator performs better?

U Zabala, I Rodriguez… - … on Robotics and …, 2021 - ieeexplore.ieee.org
Talking gestures are a fundamental part of body language and, therefore, are also important
for social robots. Gesture generation by generative approaches is supposed to produce a …

[图书][B] Identifying Different Audio Sources Through Fluid Dynamic and Acoustic Evaluation of Their Source Mechanics

L Blue - 2023 - search.proquest.com
From cave paintings to internet blogs, technology has always fundamentally shape the ways
in which humanity has communicated, bring with them new possibilities and challenges. The …