A survey on optimized implementation of deep learning models on the nvidia jetson platform

S Mittal - Journal of Systems Architecture, 2019 - Elsevier
Abstract Design of hardware accelerators for neural network (NN) applications involves
walking a tight rope amidst the constraints of low-power, high accuracy and throughput …

Energy consumption prediction using machine learning; a review

A Mosavi, A Bahmani - 2019 - preprints.org
Abstract Machine learning (ML) methods has recently contributed very well in the
advancement of the prediction models used for energy consumption. Such models highly …

SpaceA: Sparse matrix vector multiplication on processing-in-memory accelerator

X Xie, Z Liang, P Gu, A Basak, L Deng… - … Symposium on High …, 2021 - ieeexplore.ieee.org
Sparse matrix-vector multiplication (SpMV) is an important primitive across a wide range of
application domains such as scientific computing and graph analytics. Due to its intrinsic …

Accelerator for sparse-dense matrix multiplication

S Narayanamoorthy, NR Satish, A Suprun… - US Patent …, 2020 - Google Patents
Disclosed embodiments relate to an accelerator for sparse dense matrix instructions. In one
example, a processor to execute a sparse-dense matrix multiplication instruction, includes …

Via: A smart scratchpad for vector units with application to sparse matrix computations

J Pavon, IV Valdivieso, A Barredo… - … Symposium on High …, 2021 - ieeexplore.ieee.org
Sparse matrix operations are critical kernels in multiple application domains such as High
Performance Computing, artificial intelligence and big data. Vector processing is widely …

IoT‐Based Response Time Analysis of Messages for Smart Autonomous Collision Avoidance System Using Controller Area Network

AK Biswal, D Singh, BK Pattanayak… - Wireless …, 2022 - Wiley Online Library
Many accidents and serious problems occur on the road due to the rapid increase in traffic
congestion in all sections of the country. Autonomous vehicles provide a solution to …

High-performance reconfigurable DNN accelerator on a bandwidth-limited embedded system

X Hu, H Huang, X Li, X Zheng, Q Ren, J He… - ACM Transactions on …, 2023 - dl.acm.org
Deep convolutional neural networks (DNNs) have been widely used in many applications,
particularly in machine vision. It is challenging to accelerate DNNs on embedded systems …

Layerwise sparse coding for pruned deep neural networks with extreme compression ratio

X Liu, W Li, J Huo, L Yao, Y Gao - Proceedings of the AAAI Conference on …, 2020 - aaai.org
Deep neural network compression is important and increasingly developed especially in
resource-constrained environments, such as autonomous drones and wearable devices …

Accelerating binarized convolutional neural networks with dynamic partial reconfiguration on disaggregated FPGAs

P Skrimponis, E Pissadakis, N Alachiotis… - Parallel Computing …, 2020 - ebooks.iospress.nl
Abstract Convolutional Neural Networks (CNNs) currently dominate the fields of artificial
intelligence and machine learning due to their high accuracy. However, their computational …

Enabling high performance deep learning networks on embedded systems

Q Li, Q Xiao, Y Liang - IECON 2017-43rd Annual Conference of …, 2017 - ieeexplore.ieee.org
Deep learning is nowadays one of the most popular research topics in computer science. In
recent years, the extensive application of convolutional neural network has made it become …