Recurrent memory transformer A Bulatov, Y Kuratov, M Burtsev Advances in Neural Information Processing Systems 35, 11079-11091, 2022 | 108 | 2022 |
Scaling transformer to 1m tokens and beyond with rmt A Bulatov, Y Kuratov, Y Kapushev, MS Burtsev arXiv preprint arXiv:2304.11062, 2023 | 63 | 2023 |
In search of needles in a 10m haystack: Recurrent memory finds what llms miss Y Kuratov, A Bulatov, P Anokhin, D Sorokin, A Sorokin, M Burtsev arXiv preprint arXiv:2402.10790, 2024 | 14 | 2024 |
Beyond attention: Breaking the limits of transformer context length with recurrent memory A Bulatov, Y Kuratov, Y Kapushev, M Burtsev Proceedings of the AAAI Conference on Artificial Intelligence 38 (16), 17700 …, 2024 | 5 | 2024 |
Better Together: Enhancing Generative Knowledge Graph Completion with Language Models and Neighborhood Information A Chepurova, A Bulatov, Y Kuratov, M Burtsev arXiv preprint arXiv:2311.01326, 2023 | 3 | 2023 |
Prompt Me One More Time: A Two-Step Knowledge Extraction Pipeline with Ontology-Based Verification A Chepurova, Y Kuratov, A Bulatov, M Burtsev Proceedings of TextGraphs-17: Graph-based Methods for Natural Language …, 2024 | 1 | 2024 |
BABILong: Testing the Limits of LLMs with Long Context Reasoning-in-a-Haystack Y Kuratov, A Bulatov, P Anokhin, I Rodkin, D Sorokin, A Sorokin, ... arXiv preprint arXiv:2406.10149, 2024 | 1 | 2024 |
Long Input Benchmark for Russian Analysis I Churin, M Apishev, M Tikhonova, D Shevelev, A Bulatov, Y Kuratov, ... arXiv preprint arXiv:2408.02439, 2024 | | 2024 |
Associative Recurrent Memory Transformer I Rodkin, Y Kuratov, A Bulatov, M Burtsev arXiv preprint arXiv:2407.04841, 2024 | | 2024 |