Universal neural vocoding with parallel wavenet Y Jiao, A Gabryś, G Tinchev, B Putrycz, D Korzekwa, V Klimkov ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 56 | 2021 |
Interpretable deep learning model for the detection and reconstruction of dysarthric speech D Korzekwa, R Barra-Chicote, B Kostek, T Drugman, M Lajszczak Interspeech 2019, 2019 | 36 | 2019 |
Computer-assisted pronunciation training—Speech synthesis is almost all you need D Korzekwa, J Lorenzo-Trueba, T Drugman, B Kostek Speech Communication 142, 22-33, 2022 | 30 | 2022 |
Non-autoregressive TTS with explicit duration modelling for low-resource highly expressive speech R Shah, K Pokora, A Ezzerg, V Klimkov, G Huybrechts, B Putrycz, ... arXiv preprint arXiv:2106.12896, 2021 | 26 | 2021 |
Comprehensive evaluation of statistical speech waveform synthesis T Merritt, B Putrycz, A Nadolski, T Ye, D Korzekwa, W Dolecki, T Drugman, ... 2018 IEEE Spoken Language Technology Workshop (SLT), 325-331, 2018 | 21 | 2018 |
Creating New Voices using Normalizing Flows P Bilinski, T Merritt, A Ezzerg, K Pokora, S Cygert, K Yanagisawa, ... Interspeech 2022, 2022 | 19 | 2022 |
Weakly-supervised word-level pronunciation error detection in non-native English speech D Korzekwa, J Lorenzo-Trueba, T Drugman, S Calamaro, B Kostek arXiv preprint arXiv:2106.03494, 2021 | 19 | 2021 |
Text-free non-parallel many-to-many voice conversion using normalising flow T Merritt, A Ezzerg, P Biliński, M Proszewska, K Pokora, R Barra-Chicote, ... ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 16 | 2022 |
Mispronunciation detection in non-native (L2) English with uncertainty modeling D Korzekwa, J Lorenzo-Trueba, S Zaporowski, S Calamaro, T Drugman, ... ICASSP 2021-2021 IEEE international conference on acoustics, speech and …, 2021 | 16 | 2021 |
Detection of lexical stress errors in non-native (l2) english with data augmentation and attention D Korzekwa, R Barra-Chicote, S Zaporowski, G Beringer, ... arXiv preprint arXiv:2012.14788, 2020 | 13 | 2020 |
Varying speaking styles with neural textto-speech T Wood, T Merritt Alexa Blogs, Nov 19, 2018 | 12 | 2018 |
Enhancing audio quality for expressive neural text-to-speech A Ezzerg, A Gabrys, B Putrycz, D Korzekwa, D Saez-Trigueros, ... arXiv preprint arXiv:2108.06270, 2021 | 9 | 2021 |
L2-GEN: A Neural Phoneme Paraphrasing Approach to L2 Speech Synthesis for Mispronunciation Diagnosis DY Zhang, A Ganesan, S Campbell, D Korzekwa Interspeech 2022, 2022 | 7 | 2022 |
Text-to-speech (TTS) processing AF Nadolski, D Korzekwa, TE Merritt, M Nicolis, B Putrycz, RB Chicote, ... US Patent 10,699,695, 2020 | 5 | 2020 |
Remap, warp and attend: Non-parallel many-to-many accent conversion with normalizing flows A Ezzerg, T Merritt, K Yanagisawa, P Bilinski, M Proszewska, K Pokora, ... 2022 IEEE Spoken Language Technology Workshop (SLT), 984-990, 2023 | 4 | 2023 |
Constructing a dataset of speech recordings with lombard effect D Weber, S Zaporowski, D Korzekwa 2020 Signal Processing: Algorithms, Architectures, Arrangements, and …, 2020 | 4 | 2020 |
Deep learning model for automated assessment of lexical stress of non-native English speakers D Korzekwa, B Kostek The Journal of the Acoustical Society of America 146 (4_Supplement), 2956-2957, 2019 | 3 | 2019 |
AE-Flow: AutoEncoder Normalizing Flow J Mosiński, P Biliński, T Merritt, A Ezzerg, D Korzekwa ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 2 | 2023 |
On granularity of prosodic representations in expressive text-to-speech M Babiański, K Pokora, R Shah, R Sienkiewicz, D Korzekwa, V Klimkov 2022 IEEE Spoken Language Technology Workshop (SLT), 892-899, 2023 | 2 | 2023 |
Comparing normalizing flows and diffusion models for prosody and acoustic modelling in text-to-speech G Zhang, T Merritt, MS Ribeiro, B Tura-Vecino, K Yanagisawa, K Pokora, ... arXiv preprint arXiv:2307.16679, 2023 | 1 | 2023 |