A Alhammad, R Pellizzoni - 2014 Design, Automation & Test in …, 2014 - ieeexplore.ieee.org
In multicore systems, contention for access to main memory between application threads complicates timing analysis and may lead to pessimistic bounds on execution time. This is …
TN Brooks - Intelligent Computing: Proceedings of the 2018 …, 2019 - Springer
Software is everywhere, from mission critical systems such as industrial power stations, pacemakers and even household appliances. This growing dependence on technology and …
We present HORNET, a parallel, highly configurable, cycle-level multicore simulator based on an ingress-queued worm-hole router NoC architecture. The parallel simulation engine …
D Lustig, C Trippel, M Pellauer… - Proceedings of the 42nd …, 2015 - dl.acm.org
Architectural heterogeneity is increasing: numerous products and studies have proven the benefits of combining cores and accelerators with varying ISAs into a single system …
Current memory reclamation mechanisms for highly-concurrent data structures present an awkward trade-off. Techniques such as epoch-based reclamation perform well when all …
D Rogora, A Carzaniga, A Diwan, M Hauswirth… - Proceedings of the …, 2020 - dl.acm.org
To understand, debug, and predict the performance of complex software systems, we develop the concept of probabilistic performance annotations. In essence, we annotate …
P Kang - Software: Practice and Experience, 2018 - Wiley Online Library
Function call interception (FCI), or method call interception (MCI) in the object‐oriented programming domain, is a technique of intercepting function calls at program runtime …
EHM da Cruz, MAZ Alves, A Carissimi… - … on Parallel and …, 2011 - ieeexplore.ieee.org
In parallel programs, the tasks of a given application must cooperate in order to accomplish the required computation. However, the communication time between the tasks may be …
F Zhou, Y Gan, S Ma, Y Wang - 13th USENIX Symposium on Operating …, 2018 - usenix.org
This paper tries to identify waiting events that limit the maximal throughput of a multi- threaded application. To achieve this goal, we not only need to understand an event's …