B Geshkovski, C Letrouit, Y Polyanskiy… - arXiv preprint arXiv …, 2023 - arxiv.org
Transformers play a central role in the inner workings of large language models. We
develop a mathematical framework for analyzing Transformers based on their interpretation …