[PDF][PDF] Sequence-to-sequence voice conversion using F0 and time conditioning and adversarial learning

F Bous, L Benaroya, N Obin, A Roebel - CoRR, 2021 - academia.edu
This paper presents a sequence-to-sequence voice conversion (S2S-VC) algorithm which
allows to preserve some aspects of the source speaker during conversion, typically its …

[图书][B] Empowering Security and Privacy-Preserving Interactions for Smart Device Users

J Li - 2023 - search.proquest.com
Emerging smart devices, such as smart home and augmented/virtual reality systems, are
reforming our living experience by automating our daily routines and interacting with us …

Enhancing Transmission of Voice in Real-Time Applications

P Venkatesh Kumar, P Nitish Kumar… - … Research Journal on …, 2023 - rspsciencehub.com
In today's telecommunication world sharing the data becomes very easy. It is a bit-
complicated in converting the text documents to voice assistance even proposed a lot of …

Análisis de seguridad y privacidad de Asistentes Personales con voces reales y voces sintéticas

C Palacios Castrillo, R Palacios… - … de Investigación en …, 2024 - idus.us.es
En este artículo se muestra el comportamiento de varios asistentes personales (Smart
Personal Assistants, SPAs) en diversos aspectos relativos a la seguridad ya la privacidad …

[PDF][PDF] Automated techniques for creating speech corpora from public data sources for ML training

L Drabeck, B Ramanan, T Woo… - International Journal of …, 2020 - pdfs.semanticscholar.org
For machine learning (ML) to work well, there is a need for large amounts of good quality
training data. Obtaining such data is often the key bottleneck for the entire ML development …

[PDF][PDF] Lexical pitch accent and duration modeling for neural end-to-end text-to-speech synthesis.

Y Yasuda - 2021 - ir.soken.ac.jp
Text-to-speech synthesis (TTS) is a task to transform texts into speech. End-to-end TTS is
one of the TTS frameworks, and its unique approach is characterized by using only a single …

[PDF][PDF] Can DeepFake voices steal high-profile identities?

BMHF Kelly - oxfordwaveresearch.com
Computer-generated synthetic voices are increasingly growing indistinguishable from
human voices. While these high-quality synthetic voices open new horizons for the …

Der Forschungsstand zu Deepfakes und deren Erstellung

Y Wegmann - 2021 - digitalcollection.zhaw.ch
Fortschritte im Bereich der künstlichen Intelligenz und neuronalen Netzwerken haben zur
Generierung von realistischen gefälschten Inhalten geführt. Diese neue Technologie mit …

The Security Threat of Adversarial Samples to Deep Learning Networks

B Wang, Y Zhang, M Zhu, Y Chen - … Conference on Intelligent …, 2020 - ieeexplore.ieee.org
With the prosperity of artificial intelligence, research on machine learning becomes a hot
issue globally. Generative Adversarial Networks expose the huge security risks of machine …

Detecting deep-fake audio through vocal tract reconstruction

PG Traynor, K Butler, LE Blue, L Vargas… - US Patent …, 2023 - Google Patents
A method is provided for identifying synthetic “deep-fake” audio samples versus organic
audio samples. Methods may include: generating a model of a vocal tract using one or more …