Bat: Behavior-aware human-like trajectory prediction for autonomous driving

H Liao, Z Li, H Shen, W Zeng, D Liao, G Li… - Proceedings of the AAAI …, 2024 - ojs.aaai.org
The ability to accurately predict the trajectory of surrounding vehicles is a critical hurdle to
overcome on the journey to fully autonomous vehicles. To address this challenge, we …

Sal-vit: Towards latency efficient private inference on vit using selective attention search with a learnable softmax approximation

Y Zhang, D Chen, S Kundu, C Li… - Proceedings of the …, 2023 - openaccess.thecvf.com
Recently, private inference (PI) has addressed the rising concern over data and model
privacy in machine learning inference as a service. However, existing PI frameworks suffer …

CoPriv: network/protocol co-optimization for communication-efficient private inference

W Zeng, M Li, H Yang, W Lu… - Advances in Neural …, 2023 - proceedings.neurips.cc
Deep neural network (DNN) inference based on secure 2-party computation (2PC) can offer
cryptographically-secure privacy protection but suffers from orders of magnitude latency …

East: Efficient and accurate secure transformer framework for inference

Y Ding, H Guo, Y Guan, W Liu, J Huo, Z Guan… - arXiv preprint arXiv …, 2023 - arxiv.org
Transformer has been successfully used in practical applications, such as ChatGPT, due to
its powerful advantages. However, users' input is leaked to the model provider during the …

Deepreshape: Redesigning neural networks for efficient private inference

NK Jha, B Reagen - arXiv preprint arXiv:2304.10593, 2023 - arxiv.org
Prior work on Private Inference (PI)--inferences performed directly on encrypted input--has
focused on minimizing a network's ReLUs, which have been assumed to dominate PI …

RNA-ViT: Reduced-Dimension Approximate Normalized Attention Vision Transformers for Latency Efficient Private Inference

D Chen, Y Zhang, S Kundu, C Li… - 2023 IEEE/ACM …, 2023 - ieeexplore.ieee.org
The concern over data and model privacy in machine learning inference as a service
(MLaaS) has led to the development of private inference (PI) techniques. However, existing …

Converting transformers to polynomial form for secure inference over homomorphic encryption

I Zimerman, M Baruch, N Drucker, G Ezov… - arXiv preprint arXiv …, 2023 - arxiv.org
Designing privacy-preserving deep learning models is a major challenge within the deep
learning community. Homomorphic Encryption (HE) has emerged as one of the most …

Secformer: Towards fast and accurate privacy-preserving inference for large language models

J Luo, Y Zhang, J Zhang, X Mu, H Wang, Y Yu… - arXiv preprint arXiv …, 2024 - arxiv.org
With the growing use of large language models hosted on cloud platforms to offer inference
services, privacy concerns are escalating, especially concerning sensitive data like …

ASCEND: Accurate yet Efficient End-to-End Stochastic Computing Acceleration of Vision Transformer

T Xie, Y Hu, R Wei, M Li, Y Wang… - … Design, Automation & …, 2024 - ieeexplore.ieee.org
Stochastic computing (SC) has emerged as a promising computing paradigm for neural
acceleration. However, how to accelerate the state-of-the-art Vision Transformer (ViT) with …

DReP: Deep ReLU pruning for fast private inference

P Hu, L Sun, C Hu, L Dai, S Guo, M Yu - Journal of Systems Architecture, 2024 - Elsevier
With increasing concerns about privacy issues in deep learning, privacy-preserving neural
network inference has been receiving growing attention from the community, but the …