Design for a Soft Error Resilient Dynamic Task-based Runtime C Cao, T Herault, G Bosilca, J Dongarra Parallel and Distributed Processing Symposium (IPDPS), 2015 IEEE …, 2015 | 43 | 2015 |
Unified Development for Mixed Multi-GPU and Multi-coprocessor Environments Using a Lightweight Runtime Environment A Haidar, C Cao, A YarKhan, P Luszczek, S Tomov, K Kabir, J Dongarra IPDPS '14 Proceedings of the 2014 IEEE 28th International Parallel and …, 2014 | 39 | 2014 |
clMAGMA: High performance dense linear algebra with OpenCL C Cao, J Dongarra, P Du, M Gates, P Luszczek, S Tomov 1st International Workshop on OpenCL (IWOCL), 2013 | 39 | 2013 |
Video copy detection based on speeded up robust features and locality sensitive hashing Z Zhang, C Cao, R Zhang, J Zou 2010 IEEE International Conference on Automation and Logistics, 13-18, 2010 | 33 | 2010 |
Performance and portability with opencl for throughput-oriented hpc workloads across accelerators, coprocessors, and multicore processors C Cao, M Gates, A Haidar, P Luszczek, S Tomov, I Yamazaki, J Dongarra 2014 5th Workshop on Latest Advances in Scalable Algorithms for Large-Scale …, 2014 | 14 | 2014 |
Software combining to mitigate multithreaded MPI contention A Amer, C Archer, M Blocksome, C Cao, M Chuvelev, H Fujita, ... Proceedings of the ACM International Conference on Supercomputing, 367-379, 2019 | 12 | 2019 |
Flexible linear algebra development and scheduling with cholesky factorization A Haidar, A YarKhan, C Cao, P Luszczek, S Tomov, J Dongarra 2015 IEEE 17th International Conference on High Performance Computing and …, 2015 | 10 | 2015 |
Implementing a high-performance recommendation system using Phoenix++ C Cao, F Song, DG Waddington 8th International Conference for Internet Technology and Secured …, 2013 | 8 | 2013 |
Video copy detection based on temporal features of key frames Z Zhang, R Zhang, C Cao 2010 International Conference on Artificial Intelligence and Education …, 2010 | 5 | 2010 |
Efficient implementation of MPI-3 RMA over openFabrics interfaces H Fujita, C Cao, S Sur, C Archer, E Paulson, M Garzaran Parallel Computing 87, 1-10, 2019 | 2 | 2019 |
Extensions of Task-based Runtime for High Performance Dense Linear Algebra Applications C Cao | | 2017 |
A Distributed Phoenix++ Framework for Big Data Recommendation Systems C Cao, F Song, DG Waddington International Journal of Intelligent Computing Research (IJICR) 5 (1/2), 422-429, 2014 | | 2014 |