EDL-Det: A Robust TTS Synthesis Detector Using VGG19-Based YAMNet and Ensemble Learning Block

R Mahum, A Irtaza, A Javed - IEEE Access, 2023 - ieeexplore.ieee.org
Various audio deep fake synthesis algorithms exist, such as deep voice, tacotron,
fastspeech, and imitation techniques. Despite the existence of various spoofing speech …

Towards implementing a software tester for benchmarking MAP-T devices

A Al-hamadani, G Lencse - Infocommunications Journal, 2022 - real.mtak.hu
Several IPv6 transition technologies have been designed and developed over the past few
years to accelerate the full adoption of the IPv6 address pool. To make things more …

Visualising model training via vowel space for text-to-speech systems

B Abeysinghe, J James, CI Watson… - arXiv preprint arXiv …, 2022 - arxiv.org
With the recent developments in speech synthesis via machine learning, this study explores
incorporating linguistics knowledge to visualise and evaluate synthetic speech model …

Speaker adaptation experiments with limited data for end-to-end text-to-speech synthesis using tacotron2

AR Mandeel, MS Al-Radhi, TG Csapó - Infocommunications journal, 2022 - real.mtak.hu
Speech synthesis has the aim of generating humanlike speech from text. Nowadays, with
end-to-end systems, highly natural synthesized speech can be achieved if a large enough …

Building open-source speech technology for low-resource minority languages with sámi as an example–tools, methods and experiments

K Hiovain-Asikainen, S Moshagen - … of the 1st Annual Meeting of …, 2022 - aclanthology.org
This paper presents a work-in-progress report of an open-source speech technology project
for indigenous Sami languages. A less detailed description of this work has been presented …

[PDF][PDF] Developing TTS and ASR for Lule and North Sámi languages

K Hiovain-Asikainen… - Proceedings of the …, 2023 - raw.githubusercontent.com
Recent innovations in speech technology have made high quality TTS and ASR available
even for extremely low-resource languages. This paper presents our updated work-in …

[PDF][PDF] Exploring the limits of neural voice cloning: A case study on two well-known personalities

A González-Docasal, A Álvarez… - Proceedings of the …, 2022 - isca-archive.org
This work describes one successful and one failed Voice Cloning processes of two famous
personalities in order to be broadcast in a high-impact podcast and in a Spanish public …

[PDF][PDF] The Future of Speaker Adaptation: Advancements in Text-to-Speech Synthesis Solutions

AR Mandeel - isca-students.org
Personalizing a text-to-speech (TTS) model is an admiringly advantageous application. The
TTS model can create a speech for any target speaker using a limited dataset. However …