作者
Md Al Maruf, Akramul Azim, Nitin Auluck, Mansi Sahi
发表日期
2024/8/1
期刊
Journal of Parallel and Distributed Computing
卷号
190
页码范围
104890
出版商
Academic Press
简介
Deep Neural Networks (DNNs) have gained widespread popularity in different domain applications due to their dominant performance. Despite the prevalence of massively parallel multi-core processor architectures, adopting large DNN models in embedded systems remains challenging, as most embedded applications are designed with single-core processors in mind. This limits DNN adoption in embedded systems due to inefficient leveraging of model parallelization and workload partitioning. Prior solutions attempt to address these challenges using data and model parallelism. However, they lack in finding optimal DNN model partitions and distributing them efficiently to achieve improved performance.
This paper proposes a DNN model parallelism framework to accelerate model training by finding the optimal number of model partitions and resource provisions. The proposed framework combines data and …
学术搜索中的文章
M Al Maruf, A Azim, N Auluck, M Sahi - Journal of Parallel and Distributed Computing, 2024