Hard-coded gaussian attention for neural machine translation W You, S Sun, M Iyyer ACL 2020, 2020 | 67 | 2020 |
Do Long-Range Language Models Actually Use Long-Range Context? S Sun, K Krishna, A Mattarella-Micke, M Iyyer EMNLP 2021, 2021 | 65 | 2021 |
How to compare summarizers without target length? pitfalls, solutions and re-examination of the neural summarization literature S Sun, O Shapira, I Dagan, A Nenkova Proceedings of the Workshop on Methods for Optimizing and Evaluating Neural …, 2019 | 54 | 2019 |
Energy-based reranking: Improving neural machine translation using energy-based models S Bhattacharyya, A Rooshenas, S Naskar, S Sun, M Iyyer, A McCallum ACL 2021, 2020 | 42 | 2020 |
Pearl: Prompting large language models to plan and execute actions over long documents S Sun, Y Liu, S Wang, C Zhu, M Iyyer arXiv preprint arXiv:2305.14564, 2023 | 37 | 2023 |
RULER: What's the Real Context Size of Your Long-Context Language Models? CP Hsieh, S Sun, S Kriman, S Acharya, D Rekesh, F Jia, B Ginsburg arXiv preprint arXiv:2404.06654, 2024 | 26 | 2024 |
The feasibility of embedding based automatic evaluation for single document summarization S Sun, A Nenkova Proceedings of the 2019 conference on empirical methods in natural language …, 2019 | 23 | 2019 |
TopicGPT: A prompt-based topic modeling framework CM Pham, A Hoyle, S Sun, M Iyyer arXiv preprint arXiv:2311.01449, 2023 | 22 | 2023 |
Revisiting simple neural probabilistic language models S Sun, M Iyyer NAACL 2021, 2021 | 15 | 2021 |
Exploring the impact of low-rank adaptation on the performance, efficiency, and regularization of rlhf S Sun, D Gupta, M Iyyer arXiv preprint arXiv:2309.09055, 2023 | 13 | 2023 |
How does in-context learning help prompt tuning? S Sun, Y Liu, D Iter, C Zhu, M Iyyer arXiv preprint arXiv:2302.11521, 2023 | 13 | 2023 |
Alternative Input Signals Ease Transfer in Multilingual Machine Translation S Sun, A Fan, J Cross, V Chaudhary, C Tran, P Koehn, F Guzmán ACL 2022, 2022 | 12 | 2022 |
IGA: An intent-guided authoring assistant S Sun, W Zhao, V Manjunatha, R Jain, V Morariu, F Dernoncourt, ... EMNLP 2021, 2021 | 12 | 2021 |
ChapterBreak: A Challenge Dataset for Long-Range Language Models S Sun, K Thai, M Iyyer NAACL 2022, 2022 | 11 | 2022 |
Energy-based reranking: Improving neural machine translation using energy-based models S Naskar, A Rooshenas, S Sun, M Iyyer, A McCallum arXiv e-prints, arXiv: 2009.13267, 2020 | 10 | 2020 |
Name disambiguation for chinese scientific authors with multi-level clustering S Sun, H Zhang, N Li, Y Chen 2017 IEEE International Conference on Computational Science and Engineering …, 2017 | 7 | 2017 |
Efficiently Upgrading Multilingual Machine Translation Models to Support More Languages S Sun, M Elbayad, A Sun, J Cross EACL 2023, 2023 | 2 | 2023 |
Suri: Multi-constraint Instruction Following for Long-form Text Generation CM Pham, S Sun, M Iyyer arXiv preprint arXiv:2406.19371, 2024 | 1 | 2024 |
TOWARDS EFFECTIVE MODELING OF LONG-RANGE CONTEXT S SUN University of Massachusetts Amherst, 2024 | | 2024 |
How Much Do Modifications to Transformer Language Models Affect Their Ability to Learn Linguistic Knowledge? S Sun, BW Dillon, M Iyyer Proceedings of the Third Workshop on Insights from Negative Results in NLP …, 2022 | | 2022 |