Co-design for a64fx manycore processor and” fugaku”

M Sato, Y Ishikawa, H Tomita, Y Kodama… - … Conference for High …, 2020 - ieeexplore.ieee.org
We have been carrying out the FLAGSHIP 2020 Project to develop the Japanese next-
generation flagship supercomputer, the Post-K, recently named “Fugaku”. We have …

Human-scale brain simulation via supercomputer: a case study on the cerebellum

T Yamazaki, J Igarashi, H Yamaura - Neuroscience, 2021 - Elsevier
Performance of supercomputers has been steadily and exponentially increasing for the past
20 years, and is expected to increase further. This unprecedented computational power …

An in-depth analysis of the slingshot interconnect

D De Sensi, S Di Girolamo… - … Conference for High …, 2020 - ieeexplore.ieee.org
The interconnect is one of the most critical components in large scale computing systems,
and its impact on the performance of applications is going to increase with the system size …

[图书][B] Parallel programming

T Rauber, G Rünger - 2013 - Springer
Innovations in hardware architecture, such as hyper-threading or multicore processors,
make parallel computing resources available for computer systems in different areas …

Optical interconnects for high-performance computing

MA Taubenblatt - Journal of Lightwave Technology, 2011 - ieeexplore.ieee.org
High-performance computing systems are of steadily growing interest to provide new levels
of computational capability for an increasing range of applications. The growing use of and …

GENESIS 1.1: A hybrid‐parallel molecular dynamics simulator with enhanced sampling algorithms on multiple computational platforms

C Kobayashi, J Jung, Y Matsunaga, T Mori, T Ando… - 2017 - Wiley Online Library
GENeralized‐Ensemble SImulation System (GENESIS) is a software package for molecular
dynamics (MD) simulation of biological systems. It is designed to extend limitations in system …

Flare: Flexible in-network allreduce

D De Sensi, S Di Girolamo, S Ashkboos, S Li… - Proceedings of the …, 2021 - dl.acm.org
The allreduce operation is one of the most commonly used communication routines in
distributed applications. To improve its bandwidth and to reduce network traffic, this …

FFVHC-ACE: fully automated Cartesian-grid-based solver for compressible large-eddy simulation

H Asada, Y Tamaki, R Takaki, T Yumitori, S Tamura… - AIAA Journal, 2023 - arc.aiaa.org
This study presents a fully automated Cartesian-grid-based compressible flow solver,
named FrontFlow/Violet Hierarchical Cartesian for Aeronautics based on Compressible-flow …

A direct method for stereo correspondence based on singular value decomposition

M Pilu - Proceedings of IEEE Computer Society Conference on …, 1997 - ieeexplore.ieee.org
This paper proposes a new algorithm for matching point features across pairs of images.
Despite the well-known combinatorial complexity of the problem, this work shows that an …

Optical interconnects for extreme scale computing systems

S Rumley, M Bahadori, R Polster, SD Hammond… - Parallel Computing, 2017 - Elsevier
Large-scale high performance computing is permeating nearly every corner of modern
applications spanning from scientific research and business operations, to medical …