Sustainable ai: Environmental implications, challenges and opportunities

CJ Wu, R Raghavendra, U Gupta… - Proceedings of …, 2022 - proceedings.mlsys.org
This paper explores the environmental impact of the super-linear growth trends for AI from a
holistic perspective, spanning Data, Algorithms, and System Hardware. We characterize the …

Tpugraphs: A performance prediction dataset on large tensor computational graphs

M Phothilimthana, S Abu-El-Haija… - Advances in …, 2024 - proceedings.neurips.cc
Precise hardware performance models play a crucial role in code optimizations. They can
assist compilers in making heuristic decisions or aid autotuners in identifying the optimal …

Xrbench: An extended reality (xr) machine learning benchmark suite for the metaverse

H Kwon, K Nair, J Seo, J Yik… - Proceedings of …, 2023 - proceedings.mlsys.org
Real-time multi-task multi-model (MTMM) workloads, a new form of deep learning inference
workloads, are emerging for applications areas like extended reality (XR) to support …

Graft: Efficient inference serving for hybrid deep learning with SLO guarantees via DNN re-alignment

J Wu, L Wang, Q Jin, F Liu - IEEE Transactions on Parallel and …, 2023 - ieeexplore.ieee.org
Deep neural networks (DNNs) have been widely adopted for various mobile inference tasks,
yet their ever-increasing computational demands are hindering their deployment on …

Sparse-DySta: Sparsity-Aware Dynamic and Static Scheduling for Sparse Multi-DNN Workloads

H Fan, SI Venieris, A Kouris, N Lane - … of the 56th Annual IEEE/ACM …, 2023 - dl.acm.org
Running multiple deep neural networks (DNNs) in parallel has become an emerging
workload in both edge devices, such as mobile phones where multiple tasks serve a single …

Hypervolume knowledge gradient: a lookahead approach for multi-objective bayesian optimization with partial information

S Daulton, M Balandat… - … Conference on Machine …, 2023 - proceedings.mlr.press
Bayesian optimization is a popular method for sample efficient multi-objective optimization.
However, existing Bayesian optimization techniques fail to effectively exploit common and …

DVFO: Learning-Based DVFS for Energy-Efficient Edge-Cloud Collaborative Inference

Z Zhang, Y Zhao, H Li, C Lin… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Due to limited resources on edge and different characteristics of deep neural network (DNN)
models, it is a big challenge to optimize DNN inference performance in terms of energy …

Unveiling energy efficiency in deep learning: Measurement, prediction, and scoring across edge devices

X Tu, A Mallik, D Chen, K Han, O Altintas… - 2023 IEEE/ACM …, 2023 - ieeexplore.ieee.org
Today, deep learning optimization is primarily driven by research focused on achieving high
inference accuracy and reducing latency. However, the energy efficiency aspect is often …

FastML Science Benchmarks: Accelerating Real-Time Scientific Edge Machine Learning

J Duarte, N Tran, B Hawks, C Herwig, J Muhizi… - arXiv preprint arXiv …, 2022 - arxiv.org
Applications of machine learning (ML) are growing by the day for many unique and
challenging scientific applications. However, a crucial challenge facing these applications is …

ML-EXray: Visibility into ML deployment on the edge

H Qiu, I Vavelidou, J Li, E Pergament… - Proceedings of …, 2022 - proceedings.mlsys.org
Benefited from expanding cloud infrastructure, today's neural networks have increasingly
high performance trained on the cloud. Model researchers spent months of sweat competing …