Syncron: Efficient synchronization support for near-data-processing architectures

C Giannoula, N Vijaykumar… - … Symposium on High …, 2021 - ieeexplore.ieee.org
Near-Data-Processing (NDP) architectures present a promising way to alleviate data
movement costs and can provide significant performance and energy benefits to parallel …

Glocks: Efficient support for highly-contended locks in many-core cmps

JL Abell, J Fern, ME Acacio - 2011 IEEE International Parallel …, 2011 - ieeexplore.ieee.org
Synchronization is of paramount importance to exploit thread-level parallelism on many-core
CMPs. In these architectures, synchronization mechanisms usually rely on shared variables …

Efficient hardware barrier synchronization in many-core cmps

JL Abellán, J Fernández… - IEEE Transactions on …, 2011 - ieeexplore.ieee.org
Traditional software-based barrier implementations for shared memory parallel machines
tend to produce hotspots in terms of memory and network contention as the number of …

MiSAR: Minimalistic synchronization accelerator with resource overflow management

CK Liang, M Prvulovic - ACM SIGARCH Computer Architecture News, 2015 - dl.acm.org
While numerous hardware synchronization mechanisms have been proposed, they either
no longer function or suffer great performance loss when their hardware resources are …

Accelerating Irregular Applications via Efficient Synchronization and Data Access Techniques

C Giannoula - arXiv preprint arXiv:2211.05908, 2022 - arxiv.org
Irregular applications comprise an increasingly important workload domain for many fields,
including bioinformatics, chemistry, physics, social sciences and machine learning …

Paths to fast barrier synchronization on the node

C Hetland, G Tziantzioulis, B Suchy… - Proceedings of the 28th …, 2019 - dl.acm.org
Synchronization primitives like barriers heavily impact the performance of parallel programs.
As core counts increase and granularity decreases, the value of enabling fast barriers …

Notifying Memories for Dataflow Applications on Shared-Memory Parallel Computer

A Ghasemi - 2022 - theses.hal.science
Symmetric Shared-memory multiprocessor~(SMP) is the most widely used implementation
of high-performance multi-core processors. It offers a uniform shared memory view that …

Optimized communication and synchronization for embedded multiprocessors using ASIP methodology

H Xiao, T Isshiki, D Li, H Kunieda, Y Nakase… - Information and Media …, 2012 - jstage.jst.go.jp
Inter-processor communication and synchronization are critical problems in embedded
multiprocessors. In order to achieve high-speed communication and low-latency …

DynAMO: Improving Parallelism Through Dynamic Placement of Atomic Memory Operations

V Soria-Pardos, A Armejach, T Mück… - Proceedings of the 50th …, 2023 - dl.acm.org
With increasing core counts in modern multi-core designs, the overhead of synchronization
jeopardizes the scalability and efficiency of parallel applications. To mitigate these …

Design of a collective communication infrastructure for barrier synchronization in cluster-based nanoscale MPSoCs

JL Abellán, J Fernández, ME Acacio… - … , Automation & Test …, 2012 - ieeexplore.ieee.org
Barrier synchronization is a key programming primitive for shared memory embedded
MPSoCs. As the core count increases, software implementations cannot provide the needed …