Path aggregation network for instance segmentation

S Liu, L Qi, H Qin, J Shi, J Jia - Proceedings of the IEEE …, 2018 - openaccess.thecvf.com
The way that information propagates in neural networks is of great importance. In this paper,
we propose Path Aggregation Network (PANet) aiming at boosting information flow in …

Scalable irregular parallelism with GPUs: Getting CPUs out of the way

Y Chen, B Brock, S Porumbescu… - … Conference for High …, 2022 - ieeexplore.ieee.org
We present Atos, a dynamic scheduling framework for multi-node-GPU systems that
supports PGAS-style lightweight one-sided memory operations within and between nodes …

RMA-MT: a benchmark suite for assessing MPI multi-threaded RMA performance

MGF Dosanjh, T Groves, RE Grant… - 2016 16th IEEE/ACM …, 2016 - ieeexplore.ieee.org
Reaching Exascale will require leveraging massive parallelism while potentially leveraging
asynchronous communication to help achieve scalability at such large levels of concurrency …

Portus: Efficient dnn checkpointing to persistent memory with zero-copy

Y Li, T Wu, G Li, Y Song, S Yin - 2024 IEEE 44th International …, 2024 - ieeexplore.ieee.org
We introduce Portus, an efficient checkpointing system for DNN models. The core of Portus
is a three-level index structure and a direct RDMA datapath that enables fast check-points …

Energy efficiency metrics

MK Patterson - Energy efficient thermal management of data centers, 2012 - Springer
In this chapter, metrics for measuring and improving data center efficiencies are explored.
Metrics at varying levels from the infrastructure components to the entire data center are …

LogSC: Model-based one-sided communication performance estimation

Z Wang, H Chen, X Dong, W Cai, X Zhang - Future Generation Computer …, 2022 - Elsevier
One-sided communication (also known as remote memory access, or RMA) in the Message
Passing Interface (MPI) is a communication interface that has been introduced in MPI-2 …

Light-weight protocols for wire-speed ordering

H Eberle, L Dennison - SC18: International Conference for …, 2018 - ieeexplore.ieee.org
We describe light-weight protocols for selective packet ordering in out-of-order networks that
carry memory traffic. The protocols are designed for heterogeneous high-performance …

Efficient notifications for MPI one-sided applications

M Sergent, CT Aitkaci, P Lemarinier… - Proceedings of the 26th …, 2019 - dl.acm.org
MPI One-sided communications have the potential to increase applications performance by
reducing the noise on remote processors. They consist in Remote Memory Accesses …

Optimizing NEURON brain simulator with remote memory access on distributed memory systems

D Shehzad, Z Bozkus - 2015 International Conference on …, 2015 - ieeexplore.ieee.org
The Complex neuronal network models require support from simulation environment for
efficient network simulations. To compute the models increasing complexity necessitated the …

Application level reordering of remote direct memory access operations

W Lavrijsen, C Iancu - 2017 IEEE International Parallel and …, 2017 - ieeexplore.ieee.org
We present methods for the effective application level reordering of non-blocking RDMA
operations. We supplement out-of-order hardware delivery mechanisms with heuristics to …