heFFTe: Highly Efficient FFT for Exascale A Ayala, S Tomov, A Haidar, J Dongarra International Conference on Computational Science, 262-275, 2020 | 60 | 2020 |
Impacts of multi-gpu mpi collective communications on large fft computation A Ayala, S Tomov, X Luo, H Shaeik, A Haidar, G Bosilca, J Dongarra 2019 IEEE/ACM Workshop on Exascale MPI (ExaMPI), 12-18, 2019 | 31 | 2019 |
Scalability issues in FFT computation A Ayala, S Tomov, M Stoyanov, J Dongarra Parallel Computing Technologies: 16th International Conference, PaCT 2021 …, 2021 | 19 | 2021 |
Interim report on benchmarking FFT libraries on high performance systems A Ayala, S Tomov, P Luszczek, S Cayrols, G Ragghianti, J Dongarra University of Tennessee, ICL Tech Report ICLUT-21-03 7, 2021, 2021 | 15 | 2021 |
Autonomous vehicle control using a deep neural network and jetson nano R Febbo, B Flood, J Halloy, P Lau, K Wong, A Ayala Practice and Experience in Advanced Research Computing, 333-338, 2020 | 11 | 2020 |
Gpudirect mpi communications and optimizations to accelerate ffts on exascale systems H Shaiek, S Tomov, A Ayala, A Haidar, J Dongarra Extended Abstract icl-ut-19-06 9, 2019, 2019 | 11 | 2019 |
Design and Implementation for FFT-ECP on\\Distributed Accelerated Systems}} S Tomov, A Haidar, A Ayala, D Schultz, J Dongarra Network 1 (44.6), 84, 2019 | 11 | 2019 |
Accelerating multi-process communication for parallel 3-D FFT A Ayala, S Tomov, M Stoyanov, A Haidar, J Dongarra 2021 Workshop on Exascale MPI (ExaMPI), 46-53, 2021 | 7 | 2021 |
Linear-time CUR approximation of BEM matrices A Ayala, X Claeys, L Grigori Journal of Computational and Applied Mathematics 368, 112528, 2020 | 7 | 2020 |
FFT-ECP API and high-performance library prototype for 2-D and 3-D FFTs on large-scale heterogeneous systems with GPUs S Tomov, A Ayala, A Haidar, J Dongarra ECP WBS 2 (3.13), 2020-01, 2020 | 7 | 2020 |
FFT-ECP Implementation optimizations and features phase S Tomov, A Haidar, A Ayala, H Shaiek, J Dongarra Tech. Rep. ICL-UT-19-12 (2019-10 2019), 2019 | 7 | 2019 |
Local multiple traces formulation for electromagnetics: Stability and preconditioning for smooth geometries A Ayala, X Claeys, P Escapil-Inchauspé, C Jerez-Hanckes Journal of Computational and Applied Mathematics 413, 114356, 2022 | 6 | 2022 |
FFT benchmark performance experiments on systems targeting exascale A Ayala, S Tomov, P Luszczek, S Cayrols, G Ragghianti, J Dongarra Univ. Tennessee at Knoxville, Knoxville, TN, USA, Tech. Rep. ICL-UT-22–02, 2022 | 6 | 2022 |
International Conference on Computational Science A Ayala, S Tomov, A Haidar, J Dongarra Springer,, 2020 | 6 | 2020 |
Performance analysis of parallel FFT on large multi-GPU systems A Ayala, S Tomov, M Stoyanov, A Haidar, J Dongarra 2022 IEEE International Parallel and Distributed Processing Symposium …, 2022 | 5 | 2022 |
ALORA: Affine low-rank approximations A Ayala, X Claeys, L Grigori Journal of Scientific Computing 79, 1135-1160, 2019 | 5 | 2019 |
Advances in mixed precision algorithms: 2021 edition A Abdelfattah, H Anzt, A Ayala, E Boman, E Carson, S Cayrols, T Cojean, ... Lawrence Livermore National Lab.(LLNL), Livermore, CA (United States), 2021 | 4 | 2021 |
Lossy all-to-all exchange for accelerating parallel 3-D FFTs on hybrid architectures with GPUs S Cayrols, J Li, G Bosilca, S Tomov, A Ayala, J Dongarra 2022 IEEE International Conference on Cluster Computing (CLUSTER), 152-160, 2022 | 2 | 2022 |
Closed form inverse of local multi-trace operators A Ayala, X Claeys, V Dolean, MJ Gander Domain Decomposition Methods in Science and Engineering XXIII, 107-115, 2017 | 2 | 2017 |
Accelerating N-queens problem using OpenMP A Ayala, H Osman, D Shapiro, JM Desmarais, J Parri, M Bolic, V Groza 2011 6th IEEE International Symposium on Applied Computational Intelligence …, 2011 | 2 | 2011 |