Status, challenges and trends of data-intensive supercomputing

J Wei, M Chen, L Wang, P Ren, Y Lei, Y Qu… - CCF Transactions on …, 2022 - Springer
Supercomputing technology has been supporting the solution of cutting-edge scientific and
complex engineering problems since its inception—serving as a comprehensive …

Angara interconnect makes GPU-based Desmos supercomputer an efficient tool for molecular dynamics calculations

V Stegailov, E Dlinnova, T Ismagilov… - … Journal of High …, 2019 - journals.sagepub.com
In this article, we describe the Desmos supercomputer that consists of 32 hybrid nodes
connected by a low-latency high-bandwidth Angara interconnect with torus topology. This …

End-to-end I/O monitoring on leading supercomputers

B Yang, W Xue, T Zhang, S Liu, X Ma, X Wang… - ACM Transactions on …, 2023 - dl.acm.org
This paper offers a solution to overcome the complexities of production system I/O
performance monitoring. We present Beacon, an end-to-end I/O resource monitoring and …

Shentu: processing multi-trillion edge graphs on millions of cores in seconds

H Lin, X Zhu, B Yu, X Tang, W Xue… - … Conference for High …, 2018 - ieeexplore.ieee.org
Graphs are an important abstraction used in many scientific fields. With the magnitude of
graph-structured data constantly increasing, effective data analytics requires efficient and …

swSpTRSV: A fast sparse triangular solve with sparse level tile layout on sunway architectures

X Wang, W Liu, W Xue, L Wu - Proceedings of the 23rd ACM SIGPLAN …, 2018 - dl.acm.org
Sparse triangular solve (SpTRSV) is one of the most important kernels in many real-world
applications. Currently, much research on parallel SpTRSV focuses on level-set construction …

Towards efficient spmv on sunway manycore architectures

C Liu, B Xie, X Liu, W Xue, H Yang, X Liu - Proceedings of the 2018 …, 2018 - dl.acm.org
Sparse Matrix-Vector Multiplication (SpMV) is an essential computation kernel for many data-
analytic workloads running in both supercomputers and data centers. The intrinsic …

Scaling graph traversal to 281 trillion edges with 40 million cores

H Cao, Y Wang, H Wang, H Lin, Z Ma, W Yin… - Proceedings of the 27th …, 2022 - dl.acm.org
Graph processing, especially high-performance graph traversal, plays a more and more
important role in data analytics. The successor of Sunway TaihuLight, New Sunway, is …

Parallelization and optimization of NSGA-II on sunway TaihuLight system

X Liu, J Sun, L Zheng, S Wang, Y Liu… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
Sunway TaihuLight system is the first supercomputer offering a peak performance over 100
PFlops, which can be utilized to parallelize Non-dominated Sorting Genetic Algorithm II …

Scaling graph 500 sssp to 140 trillion edges with over 40 million cores

Y Wang, H Cao, Z Ma, W Yin… - … Conference for High …, 2022 - ieeexplore.ieee.org
The SSSP kernel was first introduced into the Graph 500 benchmark in 2017. However,
there has been no result from a full-scale world-top supercomputer. The primary reason is …

Tianhegraph: Customizing graph search for graph500 on tianhe supercomputer

X Gan, Y Zhang, R Wang, T Li, T Xiao… - … on Parallel and …, 2021 - ieeexplore.ieee.org
As the era of exascale supercomputing is coming, it is vital for next-generation
supercomputers to find appropriate applications with high social and economic benefit. In …