Memory affinity for hierarchical shared memory multiprocessors CP Ribeiro, JF Méhaut, A Carissimi, M Castro, LG Fernandes 2009 21st International Symposium on Computer Architecture and High …, 2009 | 95 | 2009 |
Performance/energy trade-off in scientific computing: the case of ARM big. LITTLE and Intel Sandy Bridge EL Padoin, LL Pilla, M Castro, FZ Boito, POA Navaux, JF Méhaut IET Computers & Digital Techniques 9 (1), 27-35, 2015 | 61 | 2015 |
A machine learning-based approach for thread mapping on transactional memory applications M Castro, LFW Goes, CP Ribeiro, M Cole, M Cintra, JF Mehaut 2011 18th International Conference on High Performance Computing, 1-10, 2011 | 60 | 2011 |
On the energy efficiency and performance of irregular application executions on multicore, NUMA and manycore platforms E Francesquini, M Castro, PH Penna, F Dupros, HC Freitas, POA Navaux, ... Journal of Parallel and Distributed Computing 76, 32-48, 2015 | 52 | 2015 |
NUMA-ICTM: A parallel version of ICTM exploiting memory placement strategies for NUMA machines M Castro, LG Fernandes, C Pousa, JF Méhaut, MS de Aguiar 2009 IEEE International Symposium on Parallel & Distributed Processing, 1-8, 2009 | 39 | 2009 |
Improving memory affinity of geophysics applications on NUMA platforms using Minas C Pousa Ribeiro, M Castro, JF Méhaut, A Carissimi High Performance Computing for Computational Science–VECPAR 2010: 9th …, 2011 | 34 | 2011 |
Seismic Wave Propagation Simulations on Low-power and Performance-centric Manycores M Castro, E Francesquini, F Dupros, H Aochi, POA Navaux, JF Méhaut Parallel Computing 54, 108-120, 2016 | 32 | 2016 |
A low-cost energy-efficient Raspberry Pi cluster for data mining algorithms J Saffran, G Garcia, MA Souza, PH Penna, M Castro, LFW Góes, ... European Conference on Parallel Processing, 788-799, 2016 | 30 | 2016 |
Saving energy by exploiting residual imbalances on iterative applications EL Padoin, M Castro, LL Pilla, POA Navaux, JF Méhaut 2014 21st International Conference on High Performance Computing (HiPC), 1-10, 2014 | 27 | 2014 |
Adaptive thread mapping strategies for transactional memory applications M Castro, LFW Góes, JF Méhaut Journal of Parallel and Distributed Computing 74, 2845-2859, 2014 | 24 | 2014 |
Performance improvement of stencil computations for multi-core architectures based on machine learning V Martínez, F Dupros, M Castro, P Navaux Procedia Computer Science 108, 305-314, 2017 | 22 | 2017 |
Analysis of computing and energy performance of multicore, NUMA, and manycore platforms for an irregular application M Castro, E Francesquini, TM Nguélé, JF Méhaut Proceedings of the 3rd Workshop on Irregular Applications: Architectures and …, 2013 | 22 | 2013 |
CAP Bench: a benchmark suite for performance and energy evaluation of low‐power many‐core processors MA Souza, PH Penna, MM Queiroz, AD Pereira, LFW Góes, HC Freitas, ... Concurrency and Computation: Practice and Experience 29 (4), 1-18, 2017 | 21 | 2017 |
High performance XSL-FO rendering for variable data printing F Giannetti, LG Fernandes, R Timmers, T Nunes, M Raeder, M Castro Proceedings of the 2006 ACM symposium on Applied computing, 811-817, 2006 | 19 | 2006 |
Dynamic Thread Mapping Based on Machine Learning for Transactional Memory Applications M Castro, L Góes, L Fernandes, JF Méhaut International European Conference on Parallel and Distributed Computing …, 2012 | 17 | 2012 |
A Comprehensive Performance Evaluation of the BinLPT Workload-Aware Loop Scheduler PH Penna, AT Gomes, M Castro, P Plentz, H Freitas, F Broquedis, ... Concurrency and Computation: Practice and Experience, 2019 | 16 | 2019 |
Energy Efficient Seismic Wave Propagation Simulation on a Low-power Manycore Processor. M Castro, F Dupros, E Francesquini, JF Méhaut, POA Navaux International Symposium on Computer Architecture and High Performance …, 2014 | 15 | 2014 |
Real-time video denoising on multicores and gpus with kalman-based and bilateral filters fusion SG Pfleger, PDM Plentz, RCO Rocha, AD Pereira, M Castro Journal of Real-Time Image Processing 16 (5), 1629-1642, 2019 | 13 | 2019 |
BinLPT: A novel workload-aware loop scheduler for irregular parallel loops PH Penna, M Castro, P Plentz, HC Freitas, F Broquedis, JF Méhaut XVIII Simpósio em Sistemas Computacionais de Alto Desempenho-WSCAD, 2017 | 13 | 2017 |
Automatic Skeleton-Driven Memory Affinity for Transactional Worklist Applications LFW Góes, CP Ribeiro, M Castro, JF Méhaut, M Cole, M Cintra International Journal of Parallel Programming, 365-382, 2014 | 13 | 2014 |