Parallelism-centric what-if and differential analyses

A Yoga, S Nagarakatte - Proceedings of the 40th ACM SIGPLAN …, 2019 - dl.acm.org
This paper proposes TaskProf2, a parallelism profiler and an adviser for task parallel
programs. As a parallelism profiler, TaskProf2 pinpoints regions with serialization …

Chunking loops with non-uniform workloads

IK Prabhu, VK Nandivada - Proceedings of the 34th ACM International …, 2020 - dl.acm.org
Task-parallel languages such as X10 implement dynamic lightweight task-parallel execution
model, where programmers are encouraged to express the ideal parallelism in the program …

The polyhedral model beyond loops recursion optimization and parallelization through polyhedral modeling

S Kobeissi, P Clauss - IMPACT 2019-9th International Workshop …, 2019 - inria.hal.science
There may be a huge gap between the statements outlined by programmers in a program
source code and instructions that are actually performed by a given processor architecture …

Rec2poly: Converting recursions to polyhedral optimized loops using an inspector-executor strategy

S Kobeissi, A Ketterlin, P Clauss - … , SAMOS 2020, Samos, Greece, July 5 …, 2020 - Springer
In this paper, we propose Rec2Poly, a framework which detects automatically if recursive
programs may be transformed into affine loops that are compliant with the polyhedral model …

Techniques for optimizing dynamic parallelism on graphics processing units

I El Hajj - 2018 - ideals.illinois.edu
Dynamic parallelism is a feature of general purpose graphics processing units (GPUs)
whereby threads running on a GPU can spawn other threads without CPU intervention. This …

Homeostasis: Design and Implementation of a Self-Stabilizing Compiler

A Nougrahiya, VK Nandivada - ACM Transactions on Programming …, 2024 - dl.acm.org
Mainstream compilers perform a multitude of analyses and optimizations on the given input
program. Each analysis (such as points-to analysis) may generate a program-abstraction …

[PDF][PDF] My Approach

S Gupta - gupta-suyash.github.io
My research focuses on designing high throughput and low latency yielding BFT consensus
protocols and decentralized systems, which can guard against crash and Byzantine failures …

Performance Profilers and Debugging Tools for OpenMp Applications

NB Moradi - 2021 - search.proquest.com
OpenMP is a popular application programming interface (API) used to write shared-memory
parallel programs. It supports a wide range of parallel constructs to express different types of …

Parallelism-Driven Performance Analysis Techniques for Task Parallel Programs

A Yoga - 2019 - search.proquest.com
Performance analysis of parallel programs continues to be challenging for programmers.
Programmers have to account for several factors to extract the best possible performance …