Q Lv, J Sun, S Zhou, X Zhang,
L Li, Y Gao… - arXiv preprint arXiv …, 2024 - arxiv.org
To reduce computational overhead while maintaining model performance, model pruning
techniques have been proposed. Among these, structured pruning, which removes entire …