Optical microcavities

KJ Vahala - nature, 2003 - nature.com
Optical microcavities confine light to small volumes by resonant recirculation. Devices based
on optical microcavities are already indispensable for a wide range of applications and …

Partial control-flow linearization

S Moll, S Hack - ACM SIGPLAN Notices, 2018 - dl.acm.org
If-conversion is a fundamental technique for vectorization. It accounts for the fact that in a
SIMD program, several targets of a branch might be executed because of divergence …

Exploiting recent simd architectural advances for irregular applications

L Chen, P Jiang, G Agrawal - … of the 2016 International Symposium on …, 2016 - dl.acm.org
A broad class of applications involve indirect or datadependent memory accesses and are
referred to as irregular applications. Recent developments in SIMD architectures …

Scalability analysis of AVX-512 extensions

JM Cebrian, L Natvig, M Jahre - The Journal of supercomputing, 2020 - Springer
Energy efficiency below a specific thermal design power (TDP) has become the main design
goal for microprocessors across all market segments. Optimizing the usage of the available …

KLAP: Kernel launch aggregation and promotion for optimizing dynamic parallelism

I El Hajj, J Gómez-Luna, C Li, LW Chang… - 2016 49th Annual …, 2016 - ieeexplore.ieee.org
Dynamic parallelism on GPUs simplifies the programming of many classes of applications
that generate paral-lelizable work not known prior to execution. However, modern GPUs …

Aalign: A simd framework for pairwise sequence alignment on x86-based multi-and many-core processors

K Hou, H Wang, W Feng - 2016 IEEE International Parallel and …, 2016 - ieeexplore.ieee.org
Pairwise sequence alignment algorithms, eg, Smith-Waterman and Needleman-Wunsch,
with adjustable gap penalty systems are widely used in bioinformatics. The strong data …

A static cut-off for task parallel programs

S Iwasaki, K Taura - Proceedings of the 2016 International Conference …, 2016 - dl.acm.org
Task parallel models supporting dynamic and hierarchical parallelism are believed to offer a
promising direction to achieving higher performance and programmability. Divide-and …

Conflict-free vectorization of associative irregular applications with recent SIMD architectural advances

P Jiang, G Agrawal - Proceedings of the 2018 International Symposium …, 2018 - dl.acm.org
Irregular applications that involve indirect memory accesses were traditionally considered
unsuitable for SIMD processing. Though some progress has been made in recent years, the …

MERCATOR: A GPGPU framework for irregular streaming applications

SV Cole, J Buhler - 2017 International Conference on High …, 2017 - ieeexplore.ieee.org
GPUs have a natural affinity for streaming applications exhibiting consistent, predictable
dataflow. However, many high-impact irregular streaming applications, including sequence …

Exploiting vector and multicore parallelism for recursive, data-and task-parallel programs

B Ren, S Krishnamoorthy, K Agrawal… - Proceedings of the 22nd …, 2017 - dl.acm.org
Modern hardware contains parallel execution resources that are well-suited for data-
parallelism-vector units-and task parallelism-multicores. However, most work on parallel …