Can we steal your vocal identity from the Internet?: Initial investigation of cloning Obama's...

[PDF][PDF] Sequence-to-sequence voice conversion using F0 and time conditioning and adversarial learning

F Bous, L Benaroya, N Obin, A Roebel - CoRR, 2021 - academia.edu

This paper presents a sequence-to-sequence voice conversion (S2S-VC) algorithm which
allows to preserve some aspects of the source speaker during conversion, typically its …

被引用次数：1 相关文章所有 3 个版本

[图书][B] Empowering Security and Privacy-Preserving Interactions for Smart Device Users

J Li - 2023 - search.proquest.com

Emerging smart devices, such as smart home and augmented/virtual reality systems, are
reforming our living experience by automating our daily routines and interacting with us …

[PDF] rspsciencehub.com

Enhancing Transmission of Voice in Real-Time Applications

P Venkatesh Kumar, P Nitish Kumar… - … Research Journal on …, 2023 - rspsciencehub.com

In today's telecommunication world sharing the data becomes very easy. It is a bit-
complicated in converting the text documents to voice assistance even proposed a lot of …

[PDF] us.es

Análisis de seguridad y privacidad de Asistentes Personales con voces reales y voces sintéticas

C Palacios Castrillo, R Palacios… - … de Investigación en …, 2024 - idus.us.es

En este artículo se muestra el comportamiento de varios asistentes personales (Smart
Personal Assistants, SPAs) en diversos aspectos relativos a la seguridad ya la privacidad …

[PDF] semanticscholar.org

[PDF][PDF] Automated techniques for creating speech corpora from public data sources for ML training

L Drabeck, B Ramanan, T Woo… - International Journal of …, 2020 - pdfs.semanticscholar.org

For machine learning (ML) to work well, there is a need for large amounts of good quality
training data. Obtaining such data is often the key bottleneck for the entire ML development …

被引用次数：1 相关文章所有 2 个版本

[PDF] soken.ac.jp

[PDF][PDF] Lexical pitch accent and duration modeling for neural end-to-end text-to-speech synthesis.

Y Yasuda - 2021 - ir.soken.ac.jp

Text-to-speech synthesis (TTS) is a task to transform texts into speech. End-to-end TTS is
one of the TTS frameworks, and its unique approach is characterized by using only a single …

[PDF][PDF] Can DeepFake voices steal high-profile identities?

BMHF Kelly - oxfordwaveresearch.com

Computer-generated synthetic voices are increasingly growing indistinguishable from
human voices. While these high-quality synthetic voices open new horizons for the …

Der Forschungsstand zu Deepfakes und deren Erstellung

Y Wegmann - 2021 - digitalcollection.zhaw.ch

Fortschritte im Bereich der künstlichen Intelligenz und neuronalen Netzwerken haben zur
Generierung von realistischen gefälschten Inhalten geführt. Diese neue Technologie mit …

The Security Threat of Adversarial Samples to Deep Learning Networks

B Wang, Y Zhang, M Zhu, Y Chen - … Conference on Intelligent …, 2020 - ieeexplore.ieee.org

With the prosperity of artificial intelligence, research on machine learning becomes a hot
issue globally. Generative Adversarial Networks expose the huge security risks of machine …

被引用次数：1 相关文章所有 2 个版本

[PDF] googleapis.com

Detecting deep-fake audio through vocal tract reconstruction

PG Traynor, K Butler, LE Blue, L Vargas… - US Patent …, 2023 - Google Patents

A method is provided for identifying synthetic “deep-fake” audio samples versus organic
audio samples. Methods may include: generating a model of a vocal tract using one or more …

高级搜索

QQ 群