C-LSTM: Enabling efficient LSTM using structured compression techniques on FPGAs

MMH Shuvo, SK Islam, J Cheng… - Proceedings of the …, 2022 - ieeexplore.ieee.org

Successful integration of deep neural networks (DNNs) or deep learning (DL) has resulted
in breakthroughs in many areas. However, deploying these highly accurate models for data …

被引用次数：83 相关文章所有 5 个版本

[PDF] acm.org

FPGA HLS today: successes, challenges, and opportunities

J Cong, J Lau, G Liu, S Neuendorffer, P Pan… - ACM Transactions on …, 2022 - dl.acm.org

The year 2011 marked an important transition for FPGA high-level synthesis (HLS), as it
went from prototyping to deployment. A decade later, in this article, we assess the progress …

被引用次数：100 相关文章所有 7 个版本

[PDF] lincoln.ac.uk

Unsupervised anomaly detection with LSTM autoencoders using statistical data-filtering

S Maleki, S Maleki, NR Jennings - Applied Soft Computing, 2021 - Elsevier

To address one of the most challenging industry problems, we develop an enhanced
training algorithm for anomaly detection in unlabelled sequential data such as time-series …

被引用次数：109 相关文章所有 3 个版本

[PDF] arxiv.org

Deep neural network approximation for custom hardware: Where we've been, where we're going

E Wang, JJ Davis, R Zhao, HC Ng, X Niu… - ACM Computing …, 2019 - dl.acm.org

Deep neural networks have proven to be particularly effective in visual and audio
recognition tasks. Existing models tend to be computationally expensive and memory …

被引用次数：233 相关文章所有 5 个版本

[PDF] cam.ac.uk

Flextensor: An automatic schedule exploration and optimization framework for tensor computation on heterogeneous system

S Zheng, Y Liang, S Wang, R Chen… - Proceedings of the Twenty …, 2020 - dl.acm.org

Tensor computation plays a paramount role in a broad range of domains, including machine
learning, data analytics, and scientific computing. The wide adoption of tensor computation …

被引用次数：177 相关文章所有 10 个版本

[PDF] arxiv.org

Ftrans: energy-efficient acceleration of transformers using fpga

B Li, S Pandey, H Fang, Y Lyv, J Li, J Chen… - Proceedings of the …, 2020 - dl.acm.org

In natural language processing (NLP), the" Transformer" architecture was proposed as the
first transduction model replying entirely on self-attention mechanisms without using …

被引用次数：151 相关文章所有 4 个版本

[PDF] github.io

Efficient and effective sparse LSTM on FPGA with bank-balanced sparsity

S Cao, C Zhang, Z Yao, W Xiao, L Nie, D Zhan… - Proceedings of the …, 2019 - dl.acm.org

Neural networks based on Long Short-Term Memory (LSTM) are widely deployed in latency-
sensitive language and speech applications. To speed up LSTM inference, previous …

被引用次数：203 相关文章所有 6 个版本

[PDF] researchgate.net

A high-speed and low-complexity architecture for softmax function in deep learning

M Wang, S Lu, D Zhu, J Lin… - 2018 IEEE asia pacific …, 2018 - ieeexplore.ieee.org

Recently, significant improvement has been achieved for hardware architecture design of
deep neural networks (DNNs). However, the hardware implementation of one widely used …

被引用次数：175 相关文章所有 2 个版本

[PDF] github.io

Accelerating transformer-based deep learning models on fpgas using column balanced block pruning

H Peng, S Huang, T Geng, A Li, W Jiang… - … on Quality Electronic …, 2021 - ieeexplore.ieee.org

Although Transformer-based language representations achieve state-of-the-art accuracy on
various natural language processing (NLP) tasks, the large model size has been …

被引用次数：84 相关文章所有 6 个版本

[PDF] mdpi.com

Accelerating neural network inference on FPGA-based platforms—A survey

R Wu, X Guo, J Du, J Li - Electronics, 2021 - mdpi.com

The breakthrough of deep learning has started a technological revolution in various areas
such as object identification, image/video recognition and semantic segmentation. Neural …

被引用次数：76 相关文章所有 3 个版本

高级搜索

QQ 群