Why globally re-shuffle? Revisiting data shuffling in large scale deep learning

TT Nguyen, F Trahay, J Domke, A Drozd… - 2022 IEEE …, 2022 - ieeexplore.ieee.org
Stochastic gradient descent (SGD) is the most prevalent algorithm for training Deep Neural
Networks (DNN). SGD iterates the input data set in each training epoch processing data …

Systematically inferring I/O performance variability by examining repetitive job behavior

E Costa, T Patel, B Schwaller, JM Brandt… - Proceedings of the …, 2021 - dl.acm.org
Monitoring and analyzing I/O behaviors is critical to the efficient utilization of parallel storage
systems. Unfortunately, with increasing I/O requirements and resource contention, I/O …

[PDF][PDF] Lessons From Examining Repetitive Job Behavior and I/O Performance Variability on a Production HPC System Emily Costa Northeastern University, USA …

E Costa, T Patel, B Schwaller, J Brandt, D Tiwari - 2021 - osti.gov
As I/O demand of scientific applications increases, identifying, predicting, and analyzing I/O
behaviors is critical to ensure parallel storage systems are efficiently utilized. This paper …

ASRDataset: A Multi-granularity Shuffle System for Preparing Large-scale ASR Training Data

F Jie, H Zhang, J Wang, Z Yu - 2023 IEEE International …, 2023 - ieeexplore.ieee.org
Automatic Speech Recognition (ASR) is an essential task in the field of artificial intelligence.
With the widespread application of deep learning (DL), end-to-end ASR systems have …

[PDF][PDF] Deep Neural Networks for Large-Scale Cytoarchitectonic Mapping of the Human Brain

C Schiffer - 2022 - docserv.uni-duesseldorf.de
The analysis of microstructurally distinct cytoarchitectonic areas in the human brain provides
the foundation to associate functional, physiological, genetic, molecular, and connectivity …

[PDF][PDF] Automatic Analysis of Cortical Areas in Whole Brain Histological Sections using Convolutional Neural Networks

H Spitzer - 2020 - docserv.uni-duesseldorf.de
The segregation of the human brain in cytoarchitectonic areas is an important prerequisite
for the allocation of functional imaging, physiological, connectivity, molecular and genetic …

Contribution to automatic performance analysis of parallel applications

F Trahay - 2021 - hal.science
High Performance Computing is now a strategic resource as it allows to simulate complex
phenomena in order to better understand them. While ten years ago, HPC was mostly used …

Performance comparison for neuroscience application benchmarks

A Herten, T Hater, W Klijn, D Pleiter - … Germany, June 16-20, 2019, Revised …, 2019 - Springer
Abstract Researchers within the Human Brain Project and related projects have in the last
couple of years expanded their needs for high-performance computing infrastructures. The …

Comparing Data Staging Techniques for Large Scale Brain Images

L Oden - IEEE Transactions on Emerging Topics in Computing, 2020 - ieeexplore.ieee.org
The use of Deep Learning methods is identified as a key opportunity for enabling processing
of extreme-scale scientific datasets. Efficient processing of these datasets thus requires the …

Accelerating deep learning training on high-performance computing with storage tiering

MFL Dantas - 2022 - search.proquest.com
Deep Learning (DL) has become fundamental to the advancement of several areas, such as
computer vision, natural language processing and expert systems. Utilizing DL techniques …