swSpTRSV: A fast sparse triangular solve with sparse level tile layout on sunway architectures

X Wang, W Liu, W Xue, L Wu - Proceedings of the 23rd ACM SIGPLAN …, 2018 - dl.acm.org
Sparse triangular solve (SpTRSV) is one of the most important kernels in many real-world
applications. Currently, much research on parallel SpTRSV focuses on level-set construction …

Retrofitting high availability mechanism to tame hybrid transaction/analytical processing

S Shen, R Chen, H Chen, B Zang - 15th {USENIX} Symposium on …, 2021 - usenix.org
Many application domains can benefit from hybrid transaction/analytical processing (HTAP)
by executing queries on real-time datasets produced by concurrent transactions. However …

Efficient GPU-based electromagnetic transient simulation for power systems with thread-oriented transformation and automatic code generation

Y Song, Y Chen, S Huang, Y Xu, Z Yu, W Xue - IEEE Access, 2018 - ieeexplore.ieee.org
Electromagnetic transients (EMT) simulation is the most accurate and intensive computation
for power systems. Past research has shown the potential of accelerating such simulations …

Harmonia: a high throughput B+ tree for GPUs

Z Yan, Y Lin, L Peng, W Zhang - Proceedings of the 24th symposium on …, 2019 - dl.acm.org
B+ tree is one of the most important data structures and has been widely used in different
fields. With the increase of concurrent queries and data-scale in storage, designing an …

A high throughput B+ tree for SIMD architectures

W Zhang, Z Yan, Y Lin, C Zhao… - IEEE Transactions on …, 2019 - ieeexplore.ieee.org
B+ tree is one of the most important data structures and has been widely used in different
fields. With the increase of concurrent queries and data-scale in storage, designing an …

A survey on hybrid transactional and analytical processing

H Song, W Zhou, H Cui, X Peng, F Li - The VLDB Journal, 2024 - Springer
To provide applications with the ability to analyze fresh data and eliminate the time-
consuming ETL workflow, hybrid transactional and analytical (HTAP) systems have been …

Boosting Performance and QoS for Concurrent GPU B+ trees by Combining-Based Synchronization

W Zhang, C Zhao, L Peng, Y Lin, F Zhang… - Proceedings of the 28th …, 2023 - dl.acm.org
Concurrent B+ trees have been widely used in many systems. With the scale of data
requests increasing exponentially, the systems are facing tremendous performance …

A comprehensive performance evaluation of modern in-memory indices

Z Xie, Q Cai, G Chen, R Mao… - 2018 IEEE 34th …, 2018 - ieeexplore.ieee.org
Due to poor cache utilization and latching contention, the B-tree like structures, which have
been heavily used in traditional databases, are not suitable for modern in-memory …

Efficient and scalable cross-isa virtualization of hardware transactional memory

W Wang, PC Yew, A Zhai, S McCamant - Proceedings of the 18th ACM …, 2020 - dl.acm.org
System virtualization is a key enabling technology. However, existing virtualization
techniques suffer from a significant limitation due to their limited cross-ISA support for …

A fast sparse triangular solver for structured-grid problems on sunway many-core processor SW26010

X Wang, P Xu, W Xue, Y Ao, C Yang, H Fu… - Proceedings of the 47th …, 2018 - dl.acm.org
The sparse triangular solver (SpTRSV) is one of the most essential kernels in many scientific
and engineering applications. Efficiently parallelizing the SpTRSV on modern many-core …