Computation-efficient deep learning for computer vision: A survey

Y Wang, Y Han, C Wang, S Song… - Cybernetics and …, 2024 - ieeexplore.ieee.org
Over the past decade, deep learning models have exhibited considerable advancements,
reaching or even exceeding human-level performance in a range of visual perception tasks …

MELTing point: Mobile Evaluation of Language Transformers

S Laskaridis, K Kateveas, L Minto… - arXiv preprint arXiv …, 2024 - arxiv.org
Transformers have revolutionized the machine learning landscape, gradually making their
way into everyday tasks and equipping our computers with``sparks of intelligence'' …

The future of consumer edge-ai computing

S Laskaridis, SI Venieris, A Kouris, R Li… - IEEE Pervasive …, 2024 - ieeexplore.ieee.org
In the last decade, deep learning has rapidly infiltrated the consumer end, mainly thanks to
hardware acceleration across devices. However, as we look toward the future, it is evident …

[HTML][HTML] ColabNAS: Obtaining lightweight task-specific convolutional neural networks following Occam's razor

AM Garavagno, D Leonardis, A Frisoli - Future Generation Computer …, 2024 - Elsevier
The current trend of applying transfer learning from convolutional neural networks (CNNs)
trained on large datasets can be an overkill when the target application is a custom and …

FPGA implementation of deep learning architecture for kidney cancer detection from histopathological images

S Lal, AK Chanchal, J Kini, GK Upadhyay - Multimedia Tools and …, 2024 - Springer
Kidney cancer is the most common type of cancer, and designing an automated system to
accurately classify the cancer grade is of paramount importance for a better prognosis of the …

On-device deep learning: survey on techniques improving energy efficiency of DNNs

A Boumendil, W Bechkit… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Providing high-quality predictions is no longer the sole goal for neural networks. As we live
in an increasingly interconnected world, these models need to match the constraints of …

DiMO-CNN: Deep Learning Toolkit-Accelerated Analytical Modeling and Optimization of CNN Hardware and Dataflow

J Song, R Liang, B Yuan, J Hu - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
The growing complexity of CNNs demands both hardware acceleration design and dataflow
mapping solutions. The large co-design solution space presents a huge challenge. We …

OptimML: Joint Control of Inference Latency and Server Power Consumption for ML Performance Optimization

G Chen, X Wang - ACM Transactions on Autonomous and Adaptive …, 2024 - dl.acm.org
Power capping is an important technique for high-density servers to safely oversubscribe the
power infrastructure in a data center. However, power capping is commonly accomplished …

Unleashing Network/Accelerator Co-Exploration Potential on FPGAs: A Deeper Joint Search

W Lou, L Gong, C Wang, J Qian… - … on Computer-Aided …, 2024 - ieeexplore.ieee.org
Recently, algorithm-hardware co-exploration for neural networks (NNs) has become the key
to obtaining high-quality solutions. However, previous efforts for FPGAs focus on neural …

Using the Abstract Computer Architecture Description Language to Model AI Hardware Accelerators

MM Müller, ARM Borst, K Lübeck, ALF Jung… - arXiv preprint arXiv …, 2024 - arxiv.org
Artificial Intelligence (AI) has witnessed remarkable growth, particularly through the
proliferation of Deep Neural Networks (DNNs). These powerful models drive technological …