Design principles for lifelong learning AI accelerators

D Kudithipudi, A Daram, AM Zyarah, FT Zohora… - Nature …, 2023 - nature.com
Lifelong learning—an agent's ability to learn throughout its lifetime—is a hallmark of
biological learning systems and a central challenge for artificial intelligence (AI). The …

Ten lessons from three generations shaped google's tpuv4i: Industrial product

NP Jouppi, DH Yoon, M Ashcraft… - 2021 ACM/IEEE 48th …, 2021 - ieeexplore.ieee.org
Google deployed several TPU generations since 2015, teaching us lessons that changed
our views: semi-conductor technology advances unequally; compiler compatibility trumps …

Spatten: Efficient sparse attention architecture with cascade token and head pruning

H Wang, Z Zhang, S Han - 2021 IEEE International Symposium …, 2021 - ieeexplore.ieee.org
The attention mechanism is becoming increasingly popular in Natural Language Processing
(NLP) applications, showing superior performance than convolutional and recurrent …

Bts: An accelerator for bootstrappable fully homomorphic encryption

S Kim, J Kim, MJ Kim, W Jung, J Kim, M Rhu… - Proceedings of the 49th …, 2022 - dl.acm.org
Homomorphic encryption (HE) enables the secure offloading of computations to the cloud by
providing computation on encrypted data (ciphertexts). HE is based on noisy encryption …

Silicon photonics for extreme scale systems

Y Shen, X Meng, Q Cheng, S Rumley… - Journal of Lightwave …, 2019 - opg.optica.org
High-performance systems are increasingly bottlenecked by the growing energy and
communications costs of interconnecting numerous compute and memory resources …

DNN+ NeuroSim V2. 0: An end-to-end benchmarking framework for compute-in-memory accelerators for on-chip training

X Peng, S Huang, H Jiang, A Lu… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
DNN+ NeuroSim is an integrated framework to benchmark compute-in-memory (CIM)
accelerators for deep neural networks, with hierarchical design options from device-level, to …

Ark: Fully homomorphic encryption accelerator with runtime data generation and inter-operation key reuse

J Kim, G Lee, S Kim, G Sohn, M Rhu… - 2022 55th IEEE/ACM …, 2022 - ieeexplore.ieee.org
Homomorphic Encryption (HE) is one of the most promising post-quantum cryptographic
schemes that enable privacy-preserving computation on servers. However, noise …

EnGN: A high-throughput and energy-efficient accelerator for large graph neural networks

S Liang, Y Wang, C Liu, L He… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
Graph neural networks (GNNs) emerge as a powerful approach to process non-euclidean
data structures and have been proved powerful in various application domains such as …

Transpim: A memory-based acceleration via software-hardware co-design for transformer

M Zhou, W Xu, J Kang, T Rosing - 2022 IEEE International …, 2022 - ieeexplore.ieee.org
Transformer-based models are state-of-the-art for many machine learning (ML) tasks.
Executing Transformer usually requires a long execution time due to the large memory …

SHARP: A short-word hierarchical accelerator for robust and practical fully homomorphic encryption

J Kim, S Kim, J Choi, J Park, D Kim… - Proceedings of the 50th …, 2023 - dl.acm.org
Fully homomorphic encryption (FHE) is an emerging cryptographic technology that
guarantees the privacy of sensitive user data by enabling direct computations on encrypted …