Characterizing deep learning training workloads on alibaba-pai

M Wang, C Meng, G Long, C Wu… - 2019 IEEE …, 2019 - ieeexplore.ieee.org
Modern deep learning models have been exploited in various domains, including computer
vision (CV), natural language processing (NLP), search and recommendation. In practical AI …

Performance analysis and characterization of training deep learning models on mobile device

J Liu, J Liu, W Du, D Li - 2019 IEEE 25th International …, 2019 - ieeexplore.ieee.org
Training deep learning models on mobile devices recently becomes possible, because of
increasing computation power on mobile hardware and the advantages of enhancing user …

Performance analysis of deep learning workloads on leading-edge systems

Y Ren, S Yoo, A Hoisie - 2019 IEEE/ACM Performance …, 2019 - ieeexplore.ieee.org
This work examines the performance of leading-edge systems designed for machine
learning computing, including the NVIDIA DGX-2, Amazon Web Services (AWS) P3, IBM …

Model-driven cluster resource management for ai workloads in edge clouds

Q Liang, WA Hanafy, A Ali-Eldin, P Shenoy - ACM Transactions on …, 2023 - dl.acm.org
Since emerging edge applications such as Internet of Things (IoT) analytics and augmented
reality have tight latency constraints, hardware AI accelerators have been recently proposed …

Benchmarking the performance and energy efficiency of AI accelerators for AI training

Y Wang, Q Wang, S Shi, X He, Z Tang… - 2020 20th IEEE/ACM …, 2020 - ieeexplore.ieee.org
Deep learning has become widely used in complex AI applications. Yet, training a deep
neural network (DNNs) model requires a considerable amount of calculations, long running …

Fluid: Dataset abstraction and elastic acceleration for cloud-native deep learning training jobs

R Gu, K Zhang, Z Xu, Y Che, B Fan… - 2022 IEEE 38th …, 2022 - ieeexplore.ieee.org
Nowdays, it is prevalent to train deep learning (DL) models in cloud-native platforms that
actively leverage containerization and orchestration technologies for high elasticity, low and …

Deep learning at scale on nvidia v100 accelerators

R Xu, F Han, Q Ta - 2018 IEEE/ACM Performance Modeling …, 2018 - ieeexplore.ieee.org
The recent explosion in the popularity of Deep Learning (DL) is due to a combination of
improved algorithms, access to large datasets and increased computational power. This had …

Experimental characterizations and analysis of deep learning frameworks

Y Wu, W Cao, S Sahin, L Liu - 2018 IEEE International …, 2018 - ieeexplore.ieee.org
Big Data has fueled the wide deployment of Deep Learning (DL) in many fields, such as
image classification, voice recognition and NLP. The growing number of open source DL …

Scaling deep learning workloads: Nvidia dgx-1/pascal and intel knights landing

NA Gawande, JA Daily, C Siegel, NR Tallent… - Future Generation …, 2020 - Elsevier
Deep Learning (DL) algorithms have become ubiquitous in data analytics. As a result, major
computing vendors–including NVIDIA, Intel, AMD, and IBM–have architectural road maps …

Benchmarking TPU, GPU, and CPU platforms for deep learning

YE Wang, GY Wei, D Brooks - arXiv preprint arXiv:1907.10701, 2019 - arxiv.org
Training deep learning models is compute-intensive and there is an industry-wide trend
towards hardware specialization to improve performance. To systematically benchmark …