关注
Simeng Sun
Simeng Sun
在 nvidia.com 的电子邮件经过验证 - 首页
标题
引用次数
引用次数
年份
Hard-coded gaussian attention for neural machine translation
W You, S Sun, M Iyyer
ACL 2020, 2020
672020
Do Long-Range Language Models Actually Use Long-Range Context?
S Sun, K Krishna, A Mattarella-Micke, M Iyyer
EMNLP 2021, 2021
652021
How to compare summarizers without target length? pitfalls, solutions and re-examination of the neural summarization literature
S Sun, O Shapira, I Dagan, A Nenkova
Proceedings of the Workshop on Methods for Optimizing and Evaluating Neural …, 2019
542019
Energy-based reranking: Improving neural machine translation using energy-based models
S Bhattacharyya, A Rooshenas, S Naskar, S Sun, M Iyyer, A McCallum
ACL 2021, 2020
422020
Pearl: Prompting large language models to plan and execute actions over long documents
S Sun, Y Liu, S Wang, C Zhu, M Iyyer
arXiv preprint arXiv:2305.14564, 2023
372023
RULER: What's the Real Context Size of Your Long-Context Language Models?
CP Hsieh, S Sun, S Kriman, S Acharya, D Rekesh, F Jia, B Ginsburg
arXiv preprint arXiv:2404.06654, 2024
262024
The feasibility of embedding based automatic evaluation for single document summarization
S Sun, A Nenkova
Proceedings of the 2019 conference on empirical methods in natural language …, 2019
232019
TopicGPT: A prompt-based topic modeling framework
CM Pham, A Hoyle, S Sun, M Iyyer
arXiv preprint arXiv:2311.01449, 2023
222023
Revisiting simple neural probabilistic language models
S Sun, M Iyyer
NAACL 2021, 2021
152021
Exploring the impact of low-rank adaptation on the performance, efficiency, and regularization of rlhf
S Sun, D Gupta, M Iyyer
arXiv preprint arXiv:2309.09055, 2023
132023
How does in-context learning help prompt tuning?
S Sun, Y Liu, D Iter, C Zhu, M Iyyer
arXiv preprint arXiv:2302.11521, 2023
132023
Alternative Input Signals Ease Transfer in Multilingual Machine Translation
S Sun, A Fan, J Cross, V Chaudhary, C Tran, P Koehn, F Guzmán
ACL 2022, 2022
122022
IGA: An intent-guided authoring assistant
S Sun, W Zhao, V Manjunatha, R Jain, V Morariu, F Dernoncourt, ...
EMNLP 2021, 2021
122021
ChapterBreak: A Challenge Dataset for Long-Range Language Models
S Sun, K Thai, M Iyyer
NAACL 2022, 2022
112022
Energy-based reranking: Improving neural machine translation using energy-based models
S Naskar, A Rooshenas, S Sun, M Iyyer, A McCallum
arXiv e-prints, arXiv: 2009.13267, 2020
102020
Name disambiguation for chinese scientific authors with multi-level clustering
S Sun, H Zhang, N Li, Y Chen
2017 IEEE International Conference on Computational Science and Engineering …, 2017
72017
Efficiently Upgrading Multilingual Machine Translation Models to Support More Languages
S Sun, M Elbayad, A Sun, J Cross
EACL 2023, 2023
22023
Suri: Multi-constraint Instruction Following for Long-form Text Generation
CM Pham, S Sun, M Iyyer
arXiv preprint arXiv:2406.19371, 2024
12024
TOWARDS EFFECTIVE MODELING OF LONG-RANGE CONTEXT
S SUN
University of Massachusetts Amherst, 2024
2024
How Much Do Modifications to Transformer Language Models Affect Their Ability to Learn Linguistic Knowledge?
S Sun, BW Dillon, M Iyyer
Proceedings of the Third Workshop on Insights from Negative Results in NLP …, 2022
2022
系统目前无法执行此操作,请稍后再试。
文章 1–20