Autoablation: Automated parallel ablation studies for deep learning

S Sheikholeslami, M Meister, T Wang… - Proceedings of the 1st …, 2021 - dl.acm.org
Ablation studies provide insights into the relative contribution of different architectural and
regularization components to machine learning models' performance. In this paper, we …

ExtremeEarth meets satellite data from space

DH Hagos, T Kakantousis, V Vlassov… - IEEE Journal of …, 2021 - ieeexplore.ieee.org
Bringing together a number of cutting-edge technologies that range from storing extremely
large volumes of data all the way to developing scalable machine learning and deep …

Towards Trustworthy Machine Learning in Production: An Overview of the Robustness in MLOps Approach

F Bayram, BS Ahmed - arXiv preprint arXiv:2410.21346, 2024 - arxiv.org
Artificial intelligence (AI), and especially its sub-field of Machine Learning (ML), are
impacting the daily lives of everyone with their ubiquitous applications. In recent years, AI …

AI-coupled HPC Workflow Applications, Middleware and Performance

W Brewer, A Gainaru, F Suter, F Wang… - arXiv preprint arXiv …, 2024 - arxiv.org
AI integration is revolutionizing the landscape of HPC simulations, enhancing the
importance, use, and performance of AI-driven HPC workflows. This paper surveys the …

Tuning parameters of Apache Spark with Gauss–Pareto-based multi-objective optimization

MM Öztürk - Knowledge and Information Systems, 2024 - Springer
When there is a need to make an ultimate decision about the unique features of big data
platforms, one should note that they have configurable parameters. Apache Spark is an …

ABLATOR: Robust Horizontal-Scaling of Machine Learning Ablation Experiments

I Fostiropoulos, L Itti - International Conference on …, 2023 - proceedings.mlr.press
Understanding the efficacy of a method requires ablation experiments. Current Machine
Learning (ML) workflows emphasize the vertical scaling of large models with paradigms …

Jespipe: a plugin-based, open MPI framework for adversarial machine learning analysis

S Alemany, J Nucciarone… - 2021 IEEE International …, 2021 - ieeexplore.ieee.org
Research is increasingly showing the tremendous vulnerability in machine learning models
to seemingly undetectable adversarial inputs. One of the current limitations in adversarial …

Distributed training and scalability for the particle clustering method UCluster

OS Gudnadottir, D Gedon, C Desmarais… - EPJ Web of …, 2021 - epj-conferences.org
In recent years, machine-learning methods have become increasingly important for the
experiments at the Large Hadron Collider (LHC). They are utilised in everything from trigger …

Accelerate model parallel training by using efficient graph traversal order in device placement

T Wang, AH Payberah, DH Hagos… - arXiv preprint arXiv …, 2022 - arxiv.org
Modern neural networks require long training to reach decent performance on massive
datasets. One common approach to speed up training is model parallelization, where large …

MFRLMO: Model-free reinforcement learning for multi-objective optimization of apache spark

MM Öztürk - EAI Endorsed Transactions on Scalable Information …, 2024 - papers.ssrn.com
Hyperparameter optimization (HO) is a must to figure out to what extent can a specific
configuration of hyperparameters contribute to the performance of a machine learning task …