The High Time Resolution Universe survey–XIV. Discovery of 23 pulsars through GPU-accelerated reprocessing

V Morello, ED Barr, S Cooper, M Bailes… - Monthly Notices of …, 2019 - academic.oup.com
We have performed a new search for radio pulsars in archival data of the intermediate and
high Galactic latitude parts of the Southern High Time Resolution Universe pulsar survey …

Compiling generalized histograms for gpu

T Henriksen, S Hellfritzsch… - … Conference for High …, 2020 - ieeexplore.ieee.org
We present and evaluate an implementation technique for histogram-like computations on
GPUs that ensures both work-efficient asymptotic cost, support for arbitrary associative and …

An optimized approach to histogram computation on GPU

J Gómez-Luna, JM González-Linares… - Machine Vision and …, 2013 - Springer
A histogram is a compact representation of the distribution of data in an image with a full
range of applications in diverse fields. Histogram generation is an inherently sequential …

Introducing'Bones' a parallelizing source-to-source compiler based on algorithmic skeletons

C Nugteren, H Corporaal - Proceedings of the 5th Annual Workshop on …, 2012 - dl.acm.org
Recent advances in multi-core and many-core processors requires programmers to exploit
an increasing amount of parallelism from their applications. Data parallel languages such as …

Large-scale fingerprint identification on GPU

R Cappelli, M Ferrara, D Maltoni - Information Sciences, 2015 - Elsevier
This paper proposes a new parallel algorithm to speed up fingerprint identification using
GPUs. A careful design of the algorithm and data structures, guided by well-defined …

Performance modeling of atomic additions on GPU scratchpad memory

J Gomez-Luna, JM Gonzalez-Linares… - … on Parallel and …, 2012 - ieeexplore.ieee.org
GPU application implementations using scatter approaches will fall into write contention due
to atomic updates of output elements, if these result from more than one input element …

Fast hough transform on GPUs: Exploration of algorithm trade-offs

GJ Van den Braak, C Nugteren, B Mesman… - Advanced Concepts for …, 2011 - Springer
The Hough transform is a commonly used algorithm to detect lines and other features in
images. It is robust to noise and occlusion, but has a large computational cost. This paper …

Massive atomics for massive parallelism on GPUs

IJ Egielski, J Huang, EZ Zhang - ACM SIGPLAN Notices, 2014 - dl.acm.org
One important type of parallelism exploited in many applications is reduction type
parallelism. In these applications, the order of the read-modify-write updates to one shared …

Database challenges for exploratory computing

M Buoncristiano, G Mecca, E Quintarelli, M Roveri… - ACM SIGMOD …, 2015 - dl.acm.org
Helping users to make sense of very big datasets is nowadays considered an important
research topic. However, the tools that are available for data analysis purposes typically …

GPU multisplit

S Ashkiani, A Davidson, U Meyer, JD Owens - ACM Sigplan Notices, 2016 - dl.acm.org
Multisplit is a broadly useful parallel primitive that permutes its input data into contiguous
buckets or bins, where the function that categorizes an element into a bucket is provided by …