ManaTTS Persian: a recipe for creating TTS datasets for lower resource languages

MF Qharabagh, Z Dehghanian, HR Rabiee - arXiv preprint arXiv …, 2024 - arxiv.org
In this study, we introduce ManaTTS, the most extensive publicly accessible single-speaker
Persian corpus, and a comprehensive framework for collecting transcribed speech datasets …

GE2PE: Persian End-to-End Grapheme-to-Phoneme Conversion

E Rahmati, H Sameti - Findings of the Association for …, 2024 - aclanthology.org
Abstract Text-to-Speech (TTS) systems have made significant strides, enabling the
generation of speech from grapheme sequences. However, for low-resource languages …