LayerCollapse: Adaptive compression of neural networks

SZ Shabgahi, MS Shariff, F Koushanfar - arXiv preprint arXiv:2311.17943, 2023 - arxiv.org
Handling the ever-increasing scale of contemporary deep learning and transformer-based
models poses a significant challenge. Overparameterized Transformer networks outperform …