Gpt-neox-20b: An open-source autoregressive language model S Black, S Biderman, E Hallahan, Q Anthony, L Gao, L Golding, H He, ... arXiv preprint arXiv:2204.06745, 2022 | 615 | 2022 |
Pythia: A suite for analyzing large language models across training and scaling S Biderman, H Schoelkopf, QG Anthony, H Bradley, K O’Brien, E Hallahan, ... International Conference on Machine Learning, 2397-2430, 2023 | 572 | 2023 |
Emergent and predictable memorization in large language models S Biderman, U Prashanth, L Sutawika, H Schoelkopf, Q Anthony, ... Advances in Neural Information Processing Systems 36, 2024 | 83 | 2024 |
Recite, Reconstruct, Recollect: Memorization in LMs as a Multifaceted Phenomenon US Prashanth, A Deng, K O'Brien, J SV, MA Khan, J Borkar, ... arXiv preprint arXiv:2406.17746, 2024 | | 2024 |
Raithubot: An RLHF-Fine-Tuned Telugu Chatbot for Farmers J Srinivas, US Prashanth, A Malapaka, V Amulya International Conference on Data Science and Applications, 393-403, 2023 | | 2023 |