Novel computer architectures and quantum chemistry

MS Gordon, G Barca, SS Leang, D Poole… - The Journal of …, 2020 - ACS Publications
Electronic structure theory (especially quantum chemistry) has thrived and has become
increasingly relevant to a broad spectrum of scientific endeavors as the sophistication of …

Performance Monitoring and Controlling of Multicore Shared-Memory Parallel Processing Systems

LM Haji, SRM Zeebaree, ZS Ageed… - 2022 3rd Information …, 2022 - ieeexplore.ieee.org
The field of research known as “parallel processing” examines architectural and algorithmic
methods for increasing the efficiency and other desirable qualities (such as cost …

Performance Monitoring for Processes and Threads Execution-Controlling

LM Haji, SRM Zeebaree, OM Ahmed… - … on Communication & …, 2021 - ieeexplore.ieee.org
Strong parallelism can minimize computation time whilst increasing the cost of
synchronization. It's vital to keep track of how processes and threads are working. It is …

Load balancing in a heterogeneous world: CPU-Xeon Phi co-execution of data-parallel kernels

R Nozal, B Perez, JL Bosque, R Beivide - The Journal of Supercomputing, 2019 - Springer
Heterogeneous systems composed by a CPU and a set of different hardware accelerators
are very compelling thanks to their excellent performance and energy consumption features …

COWS for High Performance: Cost Aware Work Stealing for Irregular Parallel Loop

P Mishra, VK Nandivada - ACM Transactions on Architecture and Code …, 2024 - dl.acm.org
Parallel libraries such as OpenMP distribute the iterations of parallel-for-loops among the
threads, using a programmer-specified scheduling policy. While the existing scheduling …

Scalable suffix sorting on a multicore machine

JY Xie, G Nong, B Lao, W Xu - IEEE Transactions on Computers, 2020 - ieeexplore.ieee.org
A number of methods have been proposed for suffix sorting on internal memory of RAM and
external memory of hard disks. The current best results for suffix sorting on internal or …

Parallel programming models and paradigms: Openmp analysis

A Alrawais - 2021 5th International Conference on Computing …, 2021 - ieeexplore.ieee.org
The increase demand for processing power has grown over the years, this demand lend to
the parallel approach which means linking a bunch of computers together to jointly increase …

Exploring OpenMP GPU Offloading for Implementing Convolutional Neural Networks

K Yan, Y Shi, Y Yan - Proceedings of the 14th International Workshop on …, 2023 - dl.acm.org
Computing on heterogeneous architecture involving CPUs and accelerators is now a
popular choice of parallel computing. As a directive-based programming model, OpenMP …

Efficient parallelisation of the packet classification algorithms on multi‐core central processing units using multi‐threading application program interfaces

M Abbasi, M Rafiee - IET Computers & Digital Techniques, 2020 - Wiley Online Library
The categorisation of network packets according to multiple parameters such as sender and
receiver addresses is called packet classification. Packet classification lies at the core of …

UPIR: Toward the Design of Unified Parallel Intermediate Representation for Parallel Programming Models

A Wang, X Yi, Y Yan - Proceedings of the International Conference on …, 2022 - dl.acm.org
The complexity of heterogeneous computing architectures, as well as the demand for
productive and portable parallel application development, have driven the evolution of …