During text-to-speech processing, audio data corresponding to a word part, word, or group of words is generated using a trained model and used by a unit selection engine to create …
(57) ABSTRACT A method of providing real-time speech synthesis based on user input includes presenting a graphical user interface having a low-dimensional representation of a …
M Tamura, M Morita - US Patent 11,423,874, 2022 - Google Patents
A speech synthesis model training device includes one or more hardware processors configured to perform the following. Storing, in a speech corpus storing unit, speech data …