[HTML][HTML] Computing platforms for big biological data analytics: perspectives and challenges

Z Yin, H Lan, G Tan, M Lu, AV Vasilakos… - Computational and …, 2017 - Elsevier
The last decade has witnessed an explosion in the amount of available biological sequence
data, due to the rapid progress of high-throughput sequencing projects. However, the …

Wav2letter++: A fast open-source speech recognition system

V Pratap, A Hannun, Q Xu, J Cai, J Kahn… - ICASSP 2019-2019 …, 2019 - ieeexplore.ieee.org
This paper introduces wav2letter++, a fast open-source deep learning speech recognition
framework. wav2letter++ is written entirely in C++, and uses the ArrayFire tensor library for …

Effective extensible programming: unleashing Julia on GPUs

T Besard, C Foket, B De Sutter - IEEE Transactions on Parallel …, 2018 - ieeexplore.ieee.org
GPUs and other accelerators are popular devices for accelerating compute-intensive,
parallelizable applications. However, programming these devices is a difficult task. Writing …

PyPose: A library for robot learning with physics-based optimization

C Wang, D Gao, K Xu, J Geng, Y Hu… - Proceedings of the …, 2023 - openaccess.thecvf.com
Deep learning has had remarkable success in robotic perception, but its data-centric nature
suffers when it comes to generalizing to ever-changing environments. By contrast, physics …

Voodoo-a vector algebra for portable database performance on modern hardware

H Pirk, O Moll, M Zaharia, S Madden - Proceedings of the VLDB …, 2016 - dl.acm.org
In-memory databases require careful tuning and many engineering tricks to achieve good
performance. Such database performance engineering is hard: a plethora of data and …

Flashlight: Enabling innovation in tools for machine learning

JD Kahn, V Pratap, T Likhomanenko… - International …, 2022 - proceedings.mlr.press
As the computational requirements for machine learning systems and the size and
complexity of machine learning frameworks increases, essential framework innovation has …

Efficient top-k query processing on massively parallel hardware

A Shanbhag, H Pirk, S Madden - Proceedings of the 2018 International …, 2018 - dl.acm.org
A common operation in many data analytics workloads is to find the top-k items, ie, the
largest or smallest operations according to some sort order (implemented via LIMIT or …

Automated translation and accelerated solving of differential equations on multiple GPU platforms

U Utkarsh, V Churavy, Y Ma, T Besard… - Computer Methods in …, 2024 - Elsevier
We demonstrate a high-performance vendor-agnostic method for massively parallel solving
of ensembles of ordinary differential equations (ODEs) and stochastic differential equations …

Multi-GPU system design with memory networks

G Kim, M Lee, J Jeong, J Kim - 2014 47th Annual IEEE/ACM …, 2014 - ieeexplore.ieee.org
GPUs are being widely used to accelerate different workloads and multi-GPU systems can
provide higher performance with multiple discrete GPUs interconnected together. However …

A 3D ray traced biological neural network learning model

B Yuen, X Dong, T Lu - Nature Communications, 2024 - nature.com
Training large neural networks on big datasets requires significant computational resources
and time. Transfer learning reduces training time by pre-training a base model on one …