Towards Efficient I/O Pipelines using Accumulated Compression

A Maurya, B Nicolae, MM Rafique… - 2023 IEEE 30th …, 2023 - ieeexplore.ieee.org
High-Performance Computing (HPC) workloads generate large volumes of data at high-
frequency during their execution, which needs to be captured concurrently at scale. These …

Optimizing the Training of Co-Located Deep Learning Models Using Cache-Aware Staggering

K Assogba, B Nicolae… - 2023 IEEE 30th …, 2023 - ieeexplore.ieee.org
Despite significant advances, training deep learning models remains a time-consuming and
resource-intensive task. One of the key challenges in this context is the ingestion of the …

Understanding Patterns of Deep Learning Model Evolution in Network Architecture Search

R Underwood, M Madhyastha, R Burns… - 2023 IEEE 30th …, 2023 - ieeexplore.ieee.org
Network Architecture Search and specifically Regularized Evolution is a common way to
refine the structure of a deep learning model. However, little is known about how models …

EvoStore: Towards Scalable Storage of Evolving Learning Models

R Underwood, M Madhyastha, R Burns… - HPDC'24: 34nd …, 2024 - hal.science
Deep Learning (DL) has seen rapid adoption in all domains. Since training DL models is
expensive, both in terms of time and resources, application workflows that make use of DL …