When cloud storage meets {RDMA}

Y Gao, Q Li, L Tang, Y Xi, P Zhang, W Peng… - … USENIX Symposium on …, 2021 - usenix.org
A production-level cloud storage system must be high performing and readily available. It
should also meet a Service-Level Agreement (SLA). The rapid advancement in storage …

The common communication interface (CCI)

S Atchley, D Dillow, G Shipman… - 2011 IEEE 19th …, 2011 - ieeexplore.ieee.org
There are many APIs for connecting and exchanging data between network peers. Each
interface varies wildly based on metrics including performance, portability, and complexity …

Optimizing blocking and nonblocking reduction operations for multicore systems: Hierarchical design and implementation

MG Venkata, P Shamis, R Sampath… - 2013 IEEE …, 2013 - ieeexplore.ieee.org
Many scientific simulations, using the Message Passing Interface (MPI) programming model,
are sensitive to the performance and scalability of reduction collective operations such as …

[PDF][PDF] Hardware-effiziente, hochparallele Implementierungen von Lattice-Boltzmann-Verfahren für komplexe Geometrien

M Wittmann - 2016 - opus4.kobv.de
Die Lattice-Boltzmann-Methoden haben sich als wichtige Verfahren zur numerischen
Strömungssimulation etabliert. Insbesondere für parallele Simulationen von Strömungen in …

MPI and UPC broadcast, scatter and gather algorithms in Xeon Phi

DA Mallón, GL Taboada… - … and Computation: Practice …, 2016 - Wiley Online Library
Accelerators have revolutionised the high performance computing (HPC) community.
Despite their advantages, their very specific programming models and limited …

Scalable PGAS collective operations in NUMA clusters

DA Mallón, GL Taboada, C Teijeiro… - Cluster computing, 2014 - Springer
The increasing number of cores per processor is turning manycore-based systems in
pervasive. This involves dealing with multiple levels of memory in non uniform memory …

High-Performance Multi-Transport MPI Design for Ultra-Scale InfiniBand Clusters

MJ Koop - 2009 - rave.ohiolink.edu
Over the past decade, rapid advances have taken place in the field of computer and network
design enabling us to connect thousands of computers together to form high performance …

MovementFinder: Visual analytics of origin-destination patterns from geo-tagged social media

S Chen, C Guo, X Yuan, J Zhang… - 2014 IEEE Conference …, 2014 - ieeexplore.ieee.org
Geo-tagged social media data can be viewed as sampling of people's trajectories in daily
life. It consists of people's movements and embeds the semantics of movements. However, it …

Virtualizing modern high-speed interconnection networks with performance and scalability

B Li, Z Huo, P Zhang, D Meng - 2010 IEEE International …, 2010 - ieeexplore.ieee.org
As one of the most important enabling technologies of cloud computing, virtualization brings
to HPC good manageability, online system maintenance, performance isolation and fault …

[PDF][PDF] Design of scalable PGAS collectives for NUMA and manycore systems

DA Mallon - PhD thesis, 2014 - gac.des.udc.es
The increasing number of cores per processor is turning multicore-based systems in
pervasive. This involves dealing with multiple levels of memory in NUMA systems …