MF Kaloorazi, RC de Lamare - IEEE Transactions on Signal …, 2018 - ieeexplore.ieee.org
An efficient, accurate, and reliable approximation of a matrix by one of lower rank is a fundamental task in numerical linear algebra and signal processing applications. In this …
In recent years approximate computing has been extensively explored as a paradigm to design hardware and software solutions that save energy by trading off on the quality of the …
J Lüttgau, M Kuhn, K Duwe, Y Alforov… - Supercomputing …, 2018 - centaur.reading.ac.uk
In current supercomputers, storage is typically provided by parallel distributed file systems for hot data and tape archives for cold data. These file systems are often compatible with …
M Rapp, M Sagi, A Pathania… - IEEE Transactions …, 2019 - ieeexplore.ieee.org
Two factors primarily affect the performance of multi-threaded tasks on many-core processors with logically-shared and physically-distributed Last-Level Cache (LLC): the LLC …
MF Kaloorazi, RC de Lamare - IEEE Journal of Selected Topics …, 2018 - ieeexplore.ieee.org
Low-rank matrix approximations play a fundamental role in numerical linear algebra and signal processing applications. This paper introduces a novel rank-revealing matrix …
Parallel applications performance strongly depends on the number of resources. Although adding new nodes usually reduces execution time, excessive amounts are often detrimental …
A massively parallel implementation of a recently developed technique for numerically integrating the transport equation, The Random Ray Method (TRRM)(Tramm et al., 2017), is …
The multiplication of a sparse matrix with a dense vector (SpMV) is a key component in many numerical schemes and its performance is known to be severely limited by main …
P Yellu, Z Zhang, MMR Monjur… - 2019 IEEE High …, 2019 - ieeexplore.ieee.org
High-performance computing (HPC) systems rely on new technologies such as emerging devices, advanced integration techniques, and computing architecture to continue …