The Lazy Neuron Phenomenon: On Emergence of Activation Sparsity in Transformers Z Li, C You, S Bhojanapalli, D Li, AS Rawat, SJ Reddi, K Ye, F Chern, ... The Eleventh International Conference on Learning Representations, 2022 | 60* | 2022 |
Decoupled context processing for context augmented language modeling Z Li, R Guo, S Kumar Advances in Neural Information Processing Systems 35, 21698-21710, 2022 | 16 | 2022 |
Rest meets react: Self-improvement for multi-step reasoning llm agent R Aksitov, S Miryoosefi, Z Li, D Li, S Babayan, K Kopparapu, Z Fisher, ... arXiv preprint arXiv:2312.10003, 2023 | 12 | 2023 |
ResMem: Learn what you can and memorize the rest Z Yang, M Lukasik, V Nagarajan, Z Li, A Rawat, M Zaheer, AK Menon, ... Advances in Neural Information Processing Systems 36, 2024 | 4 | 2024 |