CHiVE: Varying prosody in speech synthesis with a linguistically driven dynamic hierarchical conditional variational network T Kenter, V Wan, CA Chan, R Clark, J Vit International Conference on Machine Learning, 3331-3340, 2019 | 103 | 2019 |
Google's Next-Generation Real-Time Unit-Selection Synthesizer Using Sequence-to-Sequence LSTM-Based Autoencoders. V Wan, Y Agiomyrgiannakis, H Silen, J Vit INTERSPEECH, 1143-1147, 2017 | 60 | 2017 |
Current state of text-to-speech system ARTIC: a decade of research on the field of speech technologies D Tihelka, Z Hanzlíček, M Jůzová, J Vít, J Matoušek, M Grůber Text, Speech, and Dialogue: 21st International Conference, TSD 2018, Brno …, 2018 | 45 | 2018 |
Text-to-speech synthesis using an autoencoder BH Chun, J Gonzalvo, C Chan, I Agiomyrgiannakis, VPL Wan, RAJ Clark, ... US Patent 10,249,289, 2019 | 30 | 2019 |
Improving automatic dubbing with subtitle timing optimisation using video cut detection J Matoušek, J Vít 2012 IEEE International Conference on Acoustics, Speech and Signal …, 2012 | 26 | 2012 |
On the analysis of training data for WaveNet-based speech synthesis J Vít, Z Hanzlíček, J Matoušek 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018 | 16 | 2018 |
Unified Language-Independent DNN-Based G2P Converter. M Juzová, D Tihelka, J Vít Interspeech, 2085-2089, 2019 | 14 | 2019 |
Concatenation artifact detection trained from listeners evaluations J Vít, J Matoušek International Conference on Text, Speech and Dialogue, 169-176, 2013 | 11 | 2013 |
LSTM-based speech segmentation for TTS synthesis Z Hanzlíček, J Vít, D Tihelka International Conference on Text, Speech, and Dialogue, 361-372, 2019 | 9 | 2019 |
Czech speech synthesis with generative neural vocoder J Vít, Z Hanzlíček, J Matoušek Text, Speech, and Dialogue: 22nd International Conference, TSD 2019 …, 2019 | 8 | 2019 |
LSTM-based speech segmentation trained on different foreign languages Z Hanzlíček, J Vít International Conference on Text, Speech, and Dialogue, 456-464, 2020 | 7 | 2020 |
WaveNet-based speech synthesis applied to Czech: a comparison with the traditional synthesis methods Z Hanzlíček, J Vít, D Tihelka Text, Speech, and Dialogue: 21st International Conference, TSD 2018, Brno …, 2018 | 7 | 2018 |
Using Auto-Encoder BiLSTM Neural Network for Czech Grapheme-to-Phoneme Conversion M Jůzová, J Vít Text, Speech, and Dialogue: 22nd International Conference, TSD 2019 …, 2019 | 4 | 2019 |
Grappling with web technologies: The problems of remote speech recording D Tihelka, M Jůzová, J Vít International Conference on Speech and Computer, 592-602, 2020 | 2 | 2020 |
KINterestTV-towards non–invasive measure of user interest while watching tv J Leroy, F Rocca, M Mancas, R Ben Madhkour, F Grisard, T Kliegr, ... Innovative and Creative Developments in Multimodal Interaction Systems: 9th …, 2014 | 2 | 2014 |
Automatická detekce a vizualizace chyb konkatenační syntézy řeči J Vít Západočeská univerzita v Plzni, 2013 | 2 | 2013 |
Save Your Voice: Voice Banking and TTS for Anyone D Tihelka, M Řezáčková, M Grůber, Z Hanzlíček, J Vít, J Matoušek International Speech Communication Association, 2021 | 1 | 2021 |
Speakers Talking Foreign Languages in a Multi-lingual TTS System Z Hanzlíček, J Vít, M Řezáčková Text, Speech, and Dialogue: 24th International Conference, TSD 2021, Olomouc …, 2021 | 1 | 2021 |
Unit-selection speech synthesis adjustments for audiobook-based voices J Vít, J Matoušek International Conference on Text, Speech, and Dialogue, 335-342, 2016 | 1 | 2016 |
Webový nástroj pro opravy anotací řečového inventáře J Vít Západočeská univerzita v Plzni, 2016 | 1 | 2016 |