Lightweight CNN architecture design for rolling bearing fault diagnosis

L Jiang, C Shi, H Sheng, X Li… - Measurement Science and …, 2024 - iopscience.iop.org
Rolling bearing is a key component of rotating machinery, and its fault diagnosis technology
is very important to ensure the safety of equipment. With the rapid development of deep …

[HTML][HTML] Split_ Composite: A Radar Target Recognition Method on FFT Convolution Acceleration

X Li, Y He, W Zhu, W Qu, Y Li, C Li, B Zhu - Sensors, 2024 - mdpi.com
Synthetic Aperture Radar (SAR) is renowned for its all-weather and all-time imaging
capabilities, making it invaluable for ship target recognition. Despite the advancements in …

TrIM, Triangular Input Movement Systolic Array for Convolutional Neural Networks: Architecture and Hardware Implementation

C Sestito, S Agwa, T Prodromakis - IEEE Transactions on …, 2025 - ieeexplore.ieee.org
Modern hardware architectures for Convolutional Neural Networks (CNNs), other than
targeting high performance, aim at dissipating limited energy. Reducing the data movement …

Toward Efficient Retraining: A Large-Scale Approximate Neural Network Framework With Cross-Layer Optimization

T Yu, B Wu, K Chen, C Yan… - IEEE Transactions on Very …, 2024 - ieeexplore.ieee.org
Leveraging approximate multipliers in approximate neural networks (ApproxNNs) can
effectively reduce hardware area and power consumption, making them suitable for edge …

Eyelet: A Cross-Mesh NoC-Based Fine-Grained Sparse CNN Accelerator for Spatio-Temporal Parallel Computing Optimization

B Yao, L Liu, Y Peng, X Peng, R Xu… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Fine-grained sparse convolutional neural networks (CNNs) achieve a better trade-off
between model accuracy and size than coarse-grained sparse CNNs. Due to irregular data …

An Efficient Sparse CNN Inference Accelerator With Balanced Intra-and Inter-PE Workload

J Guo, T Xu, Z Wu, H Xiao - IEEE Transactions on Very Large …, 2024 - ieeexplore.ieee.org
Sparse convolutional neural networks (SCNNs) which can prune trivial parameters in the
network while maintaining the model accuracy has been proved to be an attractive approach …

LPRE: Logarithmic Posit-enabled Reconfigurable edge-AI Engine

O Kokane, M Lokhande, G Raut, A Teman… - Authorea …, 2024 - techrxiv.org
Edge-AI applications face huge challenges in resource-constrained environments,
particularly in enhancing computational efficiency within bandwidth limitations. This work …

TrIM: Triangular Input Movement Systolic Array for Convolutional Neural Networks--Part II: Architecture and Hardware Implementation

C Sestito, S Agwa, T Prodromakis - arXiv preprint arXiv:2408.10243, 2024 - arxiv.org
Modern hardware architectures for Convolutional Neural Networks (CNNs), other than
targeting high performance, aim at dissipating limited energy. Reducing the data movement …

Flex-PE: Flexible and SIMD Multi-Precision Processing Element for AI Workloads

M Lokhande, G Raut, SK Vishvakarma - arXiv preprint arXiv:2412.11702, 2024 - arxiv.org
The rapid adaptation of data driven AI models, such as deep learning inference, training,
Vision Transformers (ViTs), and other HPC applications, drives a strong need for runtime …

AttnACQ: Attentioned-AutoCorrelation Based Query for Hyperdimensional Associative Memory

T Yu, B Wu, K Chen, G Zhang… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
The low power and latency of hyperdimensional associative memory (HAM) promotes
hyperdimensional computing (HDC)'s efficiency. However, overheads of HAM can be hardly …