From characters to words: the turning point of BPE merges X Gutierrez-Vasques, C Bentz, O Sozinova, T Samardzic Proceedings of the 16th Conference of the European Chapter of the …, 2021 | 23 | 2021 |
Complexity trade-offs and equi-complexity in natural languages: a meta-analysis C Bentz, X Gutierrez-Vasques, O Sozinova, T Samardžić Linguistics Vanguard 9 (s1), 9-25, 2023 | 20 | 2023 |
TeDDi Sample: Text Data Diversity Sample for Language Comparison and Multilingual NLP S Moran, C Bentz, X Gutierrez-Vasques, O Sozinova, T Samardzic Proceedings of the Thirteenth Language Resources and Evaluation Conference …, 2022 | 7 | 2022 |
Interpretability for morphological inflection: from character-level predictions to subword-level rules T Ruzsics, O Sozinova, X Gutierrez-Vasques, T Samardzic Proceedings of the 16th Conference of the European Chapter of the …, 2021 | 5 | 2021 |
Subword Evenness (SuE) as a Predictor of Cross-lingual Transfer to Low-resource Languages O Pelloni, A Shaitarova, T Samardzic Proceedings of the 2022 Conference on Empirical Methods in Natural Language …, 2022 | 4 | 2022 |
Complex Networks-Based Approach to Russian Rhyme History Description: Linguostatistics and Database. O Sozinova DH, 891-893, 2016 | 3 | 2016 |
On Language Spaces, Scales and Cross-Lingual Transfer of UD Parsers T Samardzic, X Gutierrez-Vasques, R van der Goot, M Müller-Eberstein, ... Proceedings of the 26th Conference on Computational Natural Language …, 2022 | 2 | 2022 |
A Measure for Transparent Comparison of Linguistic Diversity in Multilingual NLP Data Sets T Samardzic, X Gutierrez, C Bentz, S Moran, O Pelloni arXiv preprint arXiv:2403.03909, 2024 | | 2024 |
Geometric Patterns in Text and Multilingual NLP O Pelloni University of Zurich, 2023 | | 2023 |
A multimedia corpus of the Yiddish language TA Arkhangel’skii, OA Sozinova Automatic Documentation and Mathematical Linguistics 49, 47-53, 2015 | | 2015 |
Subword Geometry: Picturing Word Shapes O Sozinova, T Samardžic | | |