Optimization techniques for GPU programming

P Hijma, S Heldens, A Sclocco… - ACM Computing …, 2023 - dl.acm.org
In the past decade, Graphics Processing Units have played an important role in the field of
high-performance computing and they still advance new fields such as IoT, autonomous …

Ride-hailing order dispatching at didi via reinforcement learning

Z Qin, X Tang, Y Jiao, F Zhang, Z Xu… - … Journal on Applied …, 2020 - pubsonline.informs.org
Order dispatching is instrumental to the marketplace engine of a large-scale ride-hailing
platform, such as the DiDi platform, which continuously matches passenger trip requests to …

Order-agnostic cross entropy for non-autoregressive machine translation

C Du, Z Tu, J Jiang - International conference on machine …, 2021 - proceedings.mlr.press
We propose a new training objective named order-agnostic cross entropy (OaXE) for fully
non-autoregressive translation (NAT) models. OaXE improves the standard cross-entropy …

Allox: compute allocation in hybrid clusters

TN Le, X Sun, M Chowdhury, Z Liu - Proceedings of the Fifteenth …, 2020 - dl.acm.org
Modern deep learning frameworks support a variety of hardware, including CPU, GPU, and
other accelerators, to perform computation. In this paper, we study how to schedule jobs …

Three-dimensional reconstruction of CT image features based on multi-threaded deep learning calculation

F Chen, K Muhammad, SH Wang - Pattern Recognition Letters, 2020 - Elsevier
Traditional technology uses serial processing method in CT image feature extraction. It is
prone to loss of image data, which causes problems such as ring distortion of the …

View-shuffled clustering via the modified Hungarian algorithm

W Dong, XJ Wu, T Xu, Z Feng, SAA Ahmed, M Awais… - Neural Networks, 2024 - Elsevier
In the majority of existing multi-view clustering methods, the prerequisite is that the data
have the correct cross-view correspondence. However, this strong assumption may not …

Assignment and Take-Off Approaches for Large-Scale Autonomous UAV Swarms

J Wubben, D Hernández, JM Cecilia… - IEEE Transactions …, 2023 - ieeexplore.ieee.org
In the last decade, the popularity of UAVs has increased tremendously. Nowadays, many
researchers are interested in UAV swarms. Coordinating a swarm of UAVs is a complicated …

A Study of Performance Programming of CPU, GPU accelerated Computers and SIMD Architecture

X Yi - arXiv preprint arXiv:2409.10661, 2024 - arxiv.org
Parallel computing is a standard approach to achieving high-performance computing (HPC).
Three commonly used methods to implement parallel computing include: 1) applying …

Physicist's view on the unbalanced -cardinality assignment problem

P Koehl, H Orland - Physical Review E, 2024 - APS
The k-cardinality unbalanced assignment problem asks for assigning k “agents” to k “tasks”
on a one-to-one basis, while minimizing the total cost associated with the assignment, with …

Semantic feature matching for robust mapping in agriculture

M Qadri, G Kantor - arXiv preprint arXiv:2107.04178, 2021 - arxiv.org
Visual Simultaneous Localization and Mapping (SLAM) systems are an essential
component in agricultural robotics that enable autonomous navigation and the construction …