As deep learning systems are scaled up to many billions of parameters, relating their internal structure to external behaviors becomes very challenging. Although daunting, this …
Z Hong, H Wu, S Dong, J Dong, Y Xiao… - arXiv preprint arXiv …, 2025 - arxiv.org
With the continuous advancement of large language models (LLMs) in mathematical reasoning, evaluating their performance in this domain has become a prominent research …