GPGPU-based parallel computing applied in the FEM using the conjugate gradient algorithm: a review

NK Pikle, SR Sathe, AY Vyavhare - Sādhanā, 2018 - Springer
Parallelization of the finite-element method (FEM) has been contemplated by the scientific
and high-performance computing community for over a decade. Most of the computations in …

An automatic user-adapted physical activity classification method using smartphones

P Li, Y Wang, Y Tian, TS Zhou… - IEEE Transactions on …, 2016 - ieeexplore.ieee.org
In recent years, an increasing number of people have become concerned about their health.
Most chronic diseases are related to lifestyle, and daily activity records can be used as an …

Cross-loop optimization of arithmetic intensity for finite element local assembly

F Luporini, AL Varbanescu, F Rathgeber… - ACM Transactions on …, 2015 - dl.acm.org
We study and systematically evaluate a class of composable code transformations that
improve arithmetic intensity in local assembly operations, which represent a significant …

[HTML][HTML] Numerical integration on GPUs for higher order finite elements

K Banaś, P Płaszewski, P Macioł - Computers & Mathematics with …, 2014 - Elsevier
The paper considers the problem of implementation on graphics processors of numerical
integration routines for higher order finite element approximations. The design of suitable …

Finite element numerical integration for first order approximations on multi-and many-core architectures

K Banaś, F Krużel, J Bielański - Computer Methods in Applied Mechanics …, 2016 - Elsevier
The paper presents investigations on the performance of the finite element numerical
integration algorithm for first order approximations and three processor architectures …

Multivariate normal maximum likelihood with both ordinal and continuous variables, and data missing at random

JN Pritikin, TR Brick, MC Neale - Behavior Research Methods, 2018 - Springer
A novel method for the maximum likelihood estimation of structural equation models (SEM)
with both ordinal and continuous indicators is introduced using a flexible multivariate probit …

Parallelized implementation of an explicit finite element method in many integrated core (MIC) architecture

Y Cai, G Li, W Liu - Advances in Engineering Software, 2018 - Elsevier
Hardware accelerators are becoming increasingly important in boosting high performance
computing systems. In this study, we develop a parallel explicit finite element (FE) analysis …

Exploration of OpenCL heterogeneous programming for porting solidification modeling to CPU‐GPU platforms

K Halbiniak, L Szustak, T Olas… - Concurrency and …, 2021 - Wiley Online Library
This article provides a comprehensive study of OpenCL heterogeneous programming for
porting applications to CPU–GPU computing platforms, with a real‐life application for the …

Fast GPU integration algorithm for isogeometric finite element method solvers using task dependency graphs

M Woźniak - Journal of Computational Science, 2015 - Elsevier
This article analyzes the integration for isogeometric finite element method solvers. In
particular, it shows that isogeometric solvers with higher order B-splines spend significant …

OpenCL performance portability for Xeon Phi coprocessor and NVIDIA GPUs: A case study of finite element numerical integration

K Banaś, F Krużel - Euro-Par 2014: Parallel Processing Workshops: Euro …, 2014 - Springer
We present the performance analysis of OpenCL kernels for three recently introduced many-
core accelerator architectures: Intel Xeon Phi coprocessor and NVIDIA Kepler and Fermi …