Bloom: A 176b-parameter open-access multilingual language model T Le Scao, A Fan, C Akiki, E Pavlick, S Ilić, D Hesslow, R Castagné, ... | 1335 | 2023 |
To annotate or not? predicting performance drop under domain shift H Elsahar, M Gallé Proceedings of the 2019 Conference on Empirical Methods in Natural Language …, 2019 | 117 | 2019 |
Monolingual adapters for zero-shot neural machine translation J Philip, A Berard, M Gallé, L Besacier The 2020 Conference on Empirical Methods in Natural Language Processing …, 2020 | 86 | 2020 |
Full and semi-batch clustering M Galle, JM Renders US Patent 8,880,525, 2014 | 74 | 2014 |
Between words and characters: A brief history of open-vocabulary modeling and tokenization in NLP SJ Mielke, Z Alyafeai, E Salesky, C Raffel, M Dey, M Gallé, A Raja, C Si, ... arXiv preprint arXiv:2112.10508, 2021 | 65 | 2021 |
Self-supervised and controlled multi-document opinion summarization H Elsahar, M Coavoux, M Gallé, J Rozen arXiv preprint arXiv:2004.14754, 2020 | 49 | 2020 |
Multilingual unsupervised neural machine translation with denoising adapters A Üstün, A Berard, L Besacier, M Gallé arXiv preprint arXiv:2110.10472, 2021 | 40 | 2021 |
Unsupervised aspect-based multi-document abstractive summarization M Coavoux, H Elsahar, M Gallé Proceedings of the 2nd Workshop on New Frontiers in Summarization, 42-47, 2019 | 38 | 2019 |
System and method for resolving entity coreference M Gallé, JM Renders, G Jacquet US Patent 9,189,473, 2015 | 36 | 2015 |
Investigating the effectiveness of BPE: The power of shorter sequences M Gallé Proceedings of the 2019 conference on empirical methods in natural language …, 2019 | 35 | 2019 |
Chenglei Si, Wilson Y SJ Mielke, Z Alyafeai, E Salesky, C Raffel, M Dey, M Gallé, A Raja Lee, Benoît Sagot, and Samson Tan, 2021 | 32 | 2021 |
Bloom: A 176b-parameter open-access multilingual language model. arXiv 2022 TL Scao, A Fan, C Akiki, E Pavlick, S Ilic, D Hesslow, R Castagné, ... arXiv preprint arXiv:2211.05100, 0 | 30 | |
On the Evaluation of Machine Translation for Terminology Consistency M Mahfuz ibn Alam, A Anastasopoulos, L Besacier, J Cross, M Gallé, ... arXiv e-prints, arXiv: 2106.11891, 2021 | 28* | 2021 |
Scalable spectral modeling of sparse sequence functions via a best matching algorithm AJ Quattoni, X Carreras, M Gallé US Patent App. 15/171,393, 2017 | 28 | 2017 |
Choosing word occurrences for the smallest grammar problem R Carrascosa, F Coste, M Gallé, G Infante-Lopez Language and Automata Theory and Applications, 154-165, 2010 | 28 | 2010 |
Chenglei Si, Wilson Y Lee, Benoît Sagot, et al. 2021. Between words and characters: A brief history of open-vocabulary modeling and tokenization in nlp SJ Mielke, Z Alyafeai, E Salesky, C Raffel, M Dey, M Gallé, A Raja arXiv preprint arXiv:2112.10508 9, 2021 | 27 | 2021 |
Searching for smallest grammars on large sequences and application to DNA R Carrascosa, F Coste, M Gallé, G Infante-Lopez Journal of Discrete Algorithms 11, 62–72, 2011 | 24 | 2011 |
Searching for Compact Hierarchical Structures in DNA by means of the Smallest Grammar Problem M Gallé Université de Rennes 1, 2011 | 24 | 2011 |
A multilingual neural machine translation model for biomedical data A Bérard, ZM Kim, V Nikoulina, EL Park, M Gallé arXiv preprint arXiv:2008.02878, 2020 | 22 | 2020 |
Character-based NMT with transformer R Gupta, L Besacier, M Dymetman, M Gallé arXiv preprint arXiv:1911.04997, 2019 | 22 | 2019 |