Compiling a High-level Directive-Based Programming Model for GPGPUs X Tian, R Xu, Y Yan, Z Yun, S Chandrasekaran, B Chapman The 26th International Workshop on Languages and Compilers for Parallel …, 2013 | 59 | 2013 |
Nas parallel benchmarks for gpgpus using a directive-based programming model R Xu, X Tian, S Chandrasekaran, Y Yan, B Chapman Languages and Compilers for Parallel Computing: 27th International Workshop …, 2015 | 40 | 2015 |
Compiler transformation of nested loops for general purpose GPUs X Tian, R Xu, Y Yan, S Chandrasekaran, D Eachempati, B Chapman Concurrency and Computation: Practice and Experience 28 (2), 537-556, 2016 | 16 | 2016 |
Multi‐GPU support on single node using directive‐based programming model R Xu, X Tian, S Chandrasekaran, B Chapman Scientific Programming 2015 (1), 621730, 2015 | 16 | 2015 |
The OpenACC data model: Preliminary study on its major challenges and implementations M Wolfe, S Lee, J Kim, X Tian, R Xu, B Chapman, S Chandrasekaran Parallel Computing 78, 15-27, 2018 | 11 | 2018 |
Implementing the OpenACC data model M Wolfe, S Lee, J Kim, X Tian, R Xu, S Chandrasekaran, B Chapman 2017 IEEE International Parallel and Distributed Processing Symposium …, 2017 | 10 | 2017 |
Optimizing GPU Register Usage: Extensions to OpenACC and Compiler Optimizations X Tian, D Khaldi, D Eachempati, R Xu, B Chapman 2016 45th International Conference on Parallel Processing (ICPP), 572 - 581, 2016 | 9 | 2016 |
OpenACC Parallelization and optimization of NAS parallel benchmarks R Xu, X Tian, S Chandrasekaran, Y Yan, B Chapman Proc. GPU Technol. Conf, 1-27, 2014 | 9 | 2014 |
OpenUH: open source OpenACC compiler X Tian, R Xu, B Chapman GTC2014, HPCTools Group Computer Science Department University of Houston, 2014 | 7 | 2014 |
Reduction operations in parallel loops for GPGPUs R Xu, X Tian, Y Yan, S Chandrasekaran, B Chapman Proceedings of Programming Models and Applications on Multicores and …, 2014 | 7 | 2014 |
Performance and power characteristics of matrix multiplication algorithms on multicore and shared memory machines Y Yan, J Kemp, X Tian, AM Malik, B Chapman 2012 SC Companion: High Performance Computing, Networking Storage and …, 2012 | 7 | 2012 |
Assessing one-to-one parallelism levels mapping for openmp offloading to gpus C Shen, X Tian, D Khaldi, B Chapman Proceedings of the 8th International Workshop on Programming Models and …, 2017 | 6 | 2017 |
An analytical model-based auto-tuning framework for locality-aware loop scheduling R Xu, S Chandrasekaran, X Tian, B Chapman High Performance Computing: 31st International Conference, ISC High …, 2016 | 6 | 2016 |
Acceleration of bulk memory operations in a heterogeneous multicore architecture JH Lee, Z Liu, X Tian, DH Woo, W Shi, D Boumber, Y Yan, KA Kwon Proceedings of the 21st international conference on Parallel architectures …, 2012 | 3 | 2012 |
A Compiler Optimization Framework for Directive-Based GPU Computing X Tian | | 2016 |