An analysis of the data efficiency in Tacotron2 speech synthesis system

R Mahum, A Irtaza, A Javed - IEEE Access, 2023 - ieeexplore.ieee.org

Various audio deep fake synthesis algorithms exist, such as deep voice, tacotron,
fastspeech, and imitation techniques. Despite the existence of various spoofing speech …

被引用次数：9 相关文章所有 2 个版本

[PDF] mtak.hu

Towards implementing a software tester for benchmarking MAP-T devices

A Al-hamadani, G Lencse - Infocommunications Journal, 2022 - real.mtak.hu

Several IPv6 transition technologies have been designed and developed over the past few
years to accelerate the full adoption of the IPv6 address pool. To make things more …

被引用次数：7 相关文章所有 7 个版本

[PDF] arxiv.org

Visualising model training via vowel space for text-to-speech systems

B Abeysinghe, J James, CI Watson… - arXiv preprint arXiv …, 2022 - arxiv.org

With the recent developments in speech synthesis via machine learning, this study explores
incorporating linguistics knowledge to visualise and evaluate synthetic speech model …

被引用次数：5 相关文章所有 6 个版本

[PDF] mtak.hu

Speaker adaptation experiments with limited data for end-to-end text-to-speech synthesis using tacotron2

AR Mandeel, MS Al-Radhi, TG Csapó - Infocommunications journal, 2022 - real.mtak.hu

Speech synthesis has the aim of generating humanlike speech from text. Nowadays, with
end-to-end systems, highly natural synthesized speech can be achieved if a large enough …

被引用次数：6 相关文章所有 6 个版本

[PDF] aclanthology.org

Building open-source speech technology for low-resource minority languages with sámi as an example–tools, methods and experiments

K Hiovain-Asikainen, S Moshagen - … of the 1st Annual Meeting of …, 2022 - aclanthology.org

This paper presents a work-in-progress report of an open-source speech technology project
for indigenous Sami languages. A less detailed description of this work has been presented …

被引用次数：3 相关文章所有 3 个版本

[PDF] githubusercontent.com

[PDF][PDF] Developing TTS and ASR for Lule and North Sámi languages

K Hiovain-Asikainen… - Proceedings of the …, 2023 - raw.githubusercontent.com

Recent innovations in speech technology have made high quality TTS and ASR available
even for extremely low-resource languages. This paper presents our updated work-in …

被引用次数：2 相关文章所有 5 个版本

[PDF] isca-archive.org

[PDF][PDF] Exploring the limits of neural voice cloning: A case study on two well-known personalities

A González-Docasal, A Álvarez… - Proceedings of the …, 2022 - isca-archive.org

This work describes one successful and one failed Voice Cloning processes of two famous
personalities in order to be broadcast in a high-impact podcast and in a Spanish public …

被引用次数：2 相关文章所有 3 个版本

[PDF] isca-students.org

[PDF][PDF] The Future of Speaker Adaptation: Advancements in Text-to-Speech Synthesis Solutions

AR Mandeel - isca-students.org

Personalizing a text-to-speech (TTS) model is an admiringly advantageous application. The
TTS model can create a speech for any target speaker using a limited dataset. However …

高级搜索

QQ 群