Rarr: Researching and revising what language models say, using language models L Gao, Z Dai, P Pasupat, A Chen, AT Chaganty, Y Fan, V Zhao, N Lao, ... Proceedings of the 61st Annual Meeting of the Association for Computational …, 2023 | 152* | 2023 |
Entity-based knowledge conflicts in question answering S Longpre, K Perisetla, A Chen, N Ramesh, C DuBois, S Singh arXiv preprint arXiv:2109.05052, 2021 | 141 | 2021 |
Evaluating Question Answering Evaluation A Chen, G Stanovsky, S Singh, M Gardner Proceedings of the 2nd Workshop on Machine Reading for Question Answering …, 2019 | 83 | 2019 |
MOCHA: A Dataset for Training and Evaluating Generative Reading Comprehension Metrics A Chen, G Stanovsky, S Singh, M Gardner arXiv preprint arXiv:2010.03636, 2020 | 41 | 2020 |
Evaluating Entity Disambiguation and the Role of Popularity in Retrieval-Based NLP A Chen, P Gudipati, S Longpre, X Ling, S Singh arXiv preprint arXiv:2106.06830, 2021 | 39 | 2021 |
The Data Provenance Initiative: A Large Scale Audit of Dataset Licensing & Attribution in AI S Longpre, R Mahari, A Chen, N Obeng-Marnu, D Sileo, W Brannon, ... arXiv preprint arXiv:2310.16787, 2023 | 30* | 2023 |
PURR: Efficiently Editing Language Model Hallucinations by Denoising Language Model Corruptions A Chen, P Pasupat, S Singh, H Lee, K Guu arXiv preprint arXiv:2305.14908, 2023 | 27 | 2023 |
Can Long-Context Language Models Subsume Retrieval, RAG, SQL, and More? J Lee, A Chen, Z Dai, D Dua, DS Sachan, M Boratko, Y Luan, SMR Arnold, ... arXiv preprint arXiv:2406.13121, 2024 | | 2024 |