PaLM: Scaling language modeling with pathways A Chowdhery, S Narang, J Devlin, M Bosma, G Mishra, A Roberts, ... arXiv preprint arXiv:2204.02311, 2022 | 3823 | 2022 |
Bloom: A 176b-parameter open-access multilingual language model T Le Scao, A Fan, C Akiki, E Pavlick, S Ilić, D Hesslow, R Castagné, ... | 1280 | 2023 |
Palm 2 technical report R Anil, AM Dai, O Firat, M Johnson, D Lepikhin, A Passos, S Shakeri, ... arXiv preprint arXiv:2305.10403, 2023 | 983 | 2023 |
Beyond the imitation game: Quantifying and extrapolating the capabilities of language models A Srivastava, A Rastogi, A Rao, AAM Shoeb, A Abid, A Fisch, AR Brown, ... arXiv preprint arXiv:2206.04615, 2022 | 817 | 2022 |
Bottom-up abstractive summarization S Gehrmann, Y Deng, AM Rush EMNLP 2018, 2018 | 810 | 2018 |
LSTMVis: A tool for visual analysis of hidden state dynamics in recurrent neural networks H Strobelt*, S Gehrmann*, H Pfister, AM Rush IEEE transactions on visualization and computer graphics 24 (1), 667-676, 2017 | 513 | 2017 |
BloombergGPT: A large language model for finance S Wu, O Irsoy, S Lu, V Dabravolski, M Dredze, S Gehrmann, P Kambadur, ... arXiv preprint arXiv:2303.17564, 2023 | 478 | 2023 |
GLTR: Statistical detection and visualization of generated text S Gehrmann*, H Strobelt*, AM Rush ACL Demo 2019, 2019 | 410 | 2019 |
Investigating gender bias in language models using causal mediation analysis J Vig*, S Gehrmann*, Y Belinkov*, S Qian, D Nevo, Y Singer, S Shieber NeurIPS 2021 33, 12388-12401, 2020 | 391* | 2020 |
Challenging big-bench tasks and whether chain-of-thought can solve them M Suzgun, N Scales, N Schärli, S Gehrmann, Y Tay, HW Chung, ... ACL Findings 2023, 2022 | 347 | 2022 |
ToTTo: A controlled table-to-text generation dataset AP Parikh, X Wang, S Gehrmann, M Faruqui, B Dhingra, D Yang, D Das EMNLP 2020, 2020 | 305 | 2020 |
Comparing deep learning and concept extraction based methods for patient phenotyping from clinical narratives S Gehrmann, F Dernoncourt, Y Li, ET Carlson, JT Wu, J Welt, J Foote Jr, ... PloS one 13 (2), e0192360, 2018 | 271* | 2018 |
Seq2Seq-Vis: A visual debugging tool for sequence-to-sequence models H Strobelt*, S Gehrmann*, M Behrisch, A Perer, H Pfister, AM Rush IEEE transactions on visualization and computer graphics 25 (1), 353-363, 2018 | 250 | 2018 |
Accelerated antimicrobial discovery via deep generative models and molecular dynamics simulations P Das, T Sercu, K Wadhawan, I Padhi, S Gehrmann, F Cipcigan, ... Nature Biomedical Engineering 5 (6), 613-623, 2021 | 249 | 2021 |
The language interpretability tool: Extensible, interactive visualizations and analysis for NLP models I Tenney, J Wexler, J Bastings, T Bolukbasi, A Coenen, S Gehrmann, ... ACL Demo 2020, 2020 | 178 | 2020 |
exBERT: A visual analysis tool to explore learned representations in transformers models B Hoover, H Strobelt, S Gehrmann EMNLP Demo 2019, 2019 | 174 | 2019 |
The GEM benchmark: Natural language generation, its evaluation and metrics S Gehrmann, T Adewumi, K Aggarwal, PS Ammanamanchi, ... GEM Workshop at ACL 2021, 2021 | 128 | 2021 |
Repairing the cracked foundation: A survey of obstacles in evaluation practices for generated text S Gehrmann, E Clark, T Sellam JAIR, 2022 | 116 | 2022 |
End-to-end content and plan selection for data-to-text generation S Gehrmann, FZ Dai, H Elder, AM Rush INLG 2018, 2018 | 85 | 2018 |
Palm: Scaling language modeling with pathways. arXiv 2022 A Chowdhery, S Narang, J Devlin, M Bosma, G Mishra, A Roberts, ... arXiv preprint arXiv:2204.02311 10, 2022 | 84 | 2022 |