Task-based FMM for multicore architectures E Agullo, B Bramas, O Coulaud, E Darve, M Messner, T Takahashi SIAM Journal on Scientific Computing 36 (1), C66-C93, 2014 | 95 | 2014 |
Task‐based FMM for heterogeneous architectures E Agullo, B Bramas, O Coulaud, E Darve, M Messner, T Takahashi Concurrency and Computation: Practice and Experience 28 (9), 2608-2629, 2016 | 63 | 2016 |
A novel hybrid quicksort algorithm vectorized using AVX-512 on Intel Skylake B Bramas arXiv preprint arXiv:1704.08579, 2017 | 47 | 2017 |
Bridging the gap between OpenMP and task-based runtime systems for the fast multipole method E Agullo, O Aumage, B Bramas, O Coulaud, S Pitoiset IEEE Transactions on Parallel and Distributed Systems 28 (10), 2794-2807, 2017 | 35 | 2017 |
Fast sorting algorithms using AVX-512 on Intel Knights Landing B Bramas arXiv preprint arXiv:1704.08579 305, 315, 2017 | 24 | 2017 |
ScalFMM: A Generic Parallel Fast Multipole Library P Blanchard, B Bramas, O Coulaud, E Darve, L Dupuy, A Etcheverry, ... Computational Science and Engineering (CSE), 2015 | 23 | 2015 |
Optimized M2L kernels for the Chebyshev interpolation based fast multipole method M Messner, B Bramas, O Coulaud, E Darve arXiv preprint arXiv:1210.7292, 2012 | 22 | 2012 |
An integral equation formulation of the N-body dielectric spheres problem. Part II: complexity analysis B Bramas, M Hassan, B Stamm ESAIM: Mathematical Modelling and Numerical Analysis 55, S625-S651, 2021 | 17* | 2021 |
Computing the sparse matrix vector product using block-based kernels without zero padding on processors with AVX-512 instructions B Bramas, P Kus PeerJ Computer Science 4, e151, 2018 | 16 | 2018 |
Improving parallel executions by increasing task granularity in task-based runtime systems using acyclic DAG clustering B Bramas, A Ketterlin PeerJ Computer Science 6, e247, 2020 | 15 | 2020 |
Optimization and parallelization of the boundary element method for the wave equation in time domain B Bramas Bordeaux, 2016 | 15 | 2016 |
Pipelining the fast multipole method over a runtime system E Agullo, B Bramas, O Coulaud, E Darve, M Messner, T Toru arXiv preprint arXiv:1206.0115, 2012 | 15 | 2012 |
Impact study of data locality on task-based applications through the Heteroprio scheduler B Bramas PeerJ Computer Science 5, e190, 2019 | 11 | 2019 |
Matrices over runtime systems at exascale E Agullo, G Bosilca, B Bramas, C Castagnede, O Coulaud, E Darve, ... 2012 SC Companion: High Performance Computing, Networking Storage and …, 2012 | 11 | 2012 |
Inastemp: A Novel Intrinsics‐as‐Template Library for Portable SIMD‐Vectorization B Bramas Scientific Programming 2017 (1), 5482468, 2017 | 10 | 2017 |
Task-based fast multipole method for clusters of multicore processors E Agullo, B Bramas, O Coulaud, M Khannouz, L Stanisic Inria Bordeaux Sud-Ouest, 2017 | 10 | 2017 |
A fast vectorized sorting implementation based on the ARM scalable vector extension (SVE) B Bramas PeerJ Computer Science 7, e769, 2021 | 9 | 2021 |
Increasing the degree of parallelism using speculative execution in task-based runtime systems B Bramas PeerJ Computer Science 5, e183, 2019 | 9 | 2019 |
Shape-and scale-dependent coupling between spheroids and velocity gradients in turbulence N Pujara, JA Arguedas-Leiva, CC Lalescu, B Bramas, M Wilczek Journal of Fluid Mechanics 922, R6, 2021 | 8 | 2021 |
Towards extreme scale technologies and accelerators for eurohpc hw/sw supercomputing applications for exascale: The textarossa approach G Agosta, M Aldinucci, C Alvarez, R Ammendola, Y Arfat, O Beaumont, ... Microprocessors and Microsystems 95, 104679, 2022 | 7 | 2022 |