关注
Thomas B. Rolinger
Thomas B. Rolinger
在 nvidia.com 的电子邮件经过验证
标题
引用次数
引用次数
年份
Performance considerations for scalable parallel tensor decomposition
TB Rolinger, TA Simon, CD Krieger
Journal of Parallel and Distributed Computing 129, 83-98, 2019
152019
Impact of traditional sparse optimizations on a migratory thread architecture
TB Rolinger, CD Krieger
2018 IEEE/ACM 8th Workshop on Irregular Applications: Architectures and …, 2018
112018
Performance evaluation of parallel sparse tensor decomposition implementations
TB Rolinger, TA Simon, CD Krieger
2016 6th Workshop on Irregular Applications: Architecture and Algorithms …, 2016
102016
Performance challenges for heterogeneous distributed tensor decompositions
TB Rolinger, TA Simon, CD Krieger
2017 IEEE High Performance Extreme Computing Conference (HPEC), 1-7, 2017
82017
Exploring parallel bitonic sort on a migratory thread architecture
K Velusamy, TB Rolinger, J McMahon, TA Simon
2018 IEEE High Performance extreme Computing Conference (HPEC), 1-7, 2018
72018
Optimizing data layouts for irregular applications on a migratory thread architecture
T Rolinger, C Krieger, A Sussman
2019 IEEE/ACM Workshop on Memory Centric High Performance Computing (MCHPC …, 2019
62019
Faster stochastic block partition using aggressive initial merging, compressed representation, and parallelism control
AJ Uppal, J Choi, TB Rolinger, HH Huang
2021 IEEE High Performance Extreme Computing Conference (HPEC), 1-7, 2021
52021
Runtime optimizations for irregular applications in Chapel
TB Rolinger, CD Krieger, A Sussman
The 8th Annual Chapel Imple-menters and Users Workshop (CHIUW), 2021
42021
Parallel sparse tensor decomposition in chapel
TB Rolinger, TA Simon, CD Krieger
2018 IEEE International Parallel and Distributed Processing Symposium …, 2018
42018
Towards high productivity and performance for irregular applications in chapel
TB Rolinger, J Craft, CD Krieger, A Sussman
2021 SC Workshops Supplementary Proceedings (SCWS), 1-11, 2021
32021
Compiler Optimization for Irregular Memory Access Patterns in PGAS Programs
TB Rolinger, CD Krieger, A Sussman
International Workshop on Languages and Compilers for Parallel Computing, 3-21, 2022
22022
Optimizing memory-compute colocation for irregular applications on a migratory thread architecture
TB Rolinger, CD Krieger, A Sussman
2021 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2021
22021
An empirical evaluation of allgatherv on multi-gpu systems
TB Rolinger, TA Simon, CD Krieger
2018 18th IEEE/ACM International Symposium on Cluster, Cloud and Grid …, 2018
22018
Performance Strategies for Parallel Bitonic Sort on a Migratory Thread Architecture
K Velusamy, TB Rolinger, J McMahon
2020 IEEE High Performance Extreme Computing Conference (HPEC), 1-7, 2020
12020
TLPGNN: A Lightweight Two-Level Parallelism Paradigm for Graph Neural Network Computation on Single and Multiple GPUs
Q Fu, Y Ji, T Rolinger, HH Huang
ACM Transactions on Parallel Computing 11 (2), 1-28, 2024
2024
JITSPMM: Just-in-Time Instruction Generation for Accelerated Sparse Matrix-Matrix Multiplication
Q Fu, TB Rolinger, HH Huang
2024 IEEE/ACM International Symposium on Code Generation and Optimization …, 2024
2024
Decontentioned Stochastic Block Partition
AJ Uppal, TB Rolinger, HH Huang
2023 IEEE High Performance Extreme Computing Conference (HPEC), 1-6, 2023
2023
Compiler Optimizations for Irregular Memory Access Patterns in the PGAS Programming Model
TB Rolinger
University of Maryland, College Park, 2023
2023
2022 Future Computing Summer Internship: TenTS (Tensor Toolbox in Scala)
N Nguyen, L Vandecasteele, T Ranadive, T Rolinger
2022
Adaptive Prefetching for Fine-grain Communication in PGAS Programs
TB Rolinger, A Sussman
系统目前无法执行此操作,请稍后再试。
文章 1–20