TransCODE: Co-design of transformers and accelerators for efficient training and inference- 学术资源搜索

TransCODE: Co-design of transformers and accelerators for efficient training and inference

S Tuli, NK Jha - IEEE Transactions on Computer-Aided Design …, 2023 - ieeexplore.ieee.org

IEEE Transactions on Computer-Aided Design of Integrated Circuits …, 2023•ieeexplore.ieee.org

Automated co-design of machine learning models and evaluation hardware is critical for efficiently deploying such models at scale. Despite the state-of-the-art performance of transformer models, they are not yet ready for execution on resource-constrained hardware platforms. High memory requirements and low parallelizability of the transformer architecture exacerbate this problem. Recently proposed accelerators attempt to optimize the throughput and energy consumption of transformer models. However, such works are either limited to a one-sided search of the model architecture or a restricted set of off-the-shelf devices. Furthermore, previous works only accelerate model inference and not training, which incurs substantially higher memory and compute resources, making the problem even more challenging. To address these limitations, this work proposes a dynamic training framework, called DynaProp, that speeds up the training process and reduces memory consumption. DynaProp is a low-overhead pruning method that prunes activations and gradients at runtime. To effectively execute this method on hardware for a diverse set of transformer architectures, we propose a flexible BERT accelerator, a framework that simulates transformer inference and training on a design space of accelerators. We use this simulator in conjunction with the proposed co-design technique, called TransCODE, to obtain the best-performing models with high accuracy on the given task and minimize latency, energy consumption, and chip area. The obtained transformer–accelerator pair achieves 0.3% higher accuracy than the state-of-the-art pair while incurring lower latency and lower energy consumption.

ieeexplore.ieee.org

展开收起

被引用次数：6 相关文章所有 5 个版本

以上显示的是最相近的搜索结果。查看全部搜索结果

高级搜索

QQ 群

TransCODE: Co-design of transformers and accelerators for efficient training and inference

引用