Q-vit: Accurate and fully quantized low-bit vision transformer

Y Li, S Xu, B Zhang, X Cao, P Gao… - Advances in neural …, 2022 - proceedings.neurips.cc
The large pre-trained vision transformers (ViTs) have demonstrated remarkable
performance on various visual tasks, but suffer from expensive computational and memory …

When object detection meets knowledge distillation: A survey

Z Li, P Xu, X Chang, L Yang, Y Zhang… - … on Pattern Analysis …, 2023 - ieeexplore.ieee.org
Object detection (OD) is a crucial computer vision task that has seen the development of
many algorithms and models over the years. While the performance of current OD models …

Q-detr: An efficient low-bit quantized detection transformer

S Xu, Y Li, M Lin, P Gao, G Guo… - Proceedings of the …, 2023 - openaccess.thecvf.com
The recent detection transformer (DETR) has advanced object detection, but its application
on resource-constrained devices requires massive computation and memory resources …

Resilient binary neural network

S Xu, Y Li, T Ma, M Lin, H Dong, B Zhang… - Proceedings of the …, 2023 - ojs.aaai.org
Binary neural networks (BNNs) have received ever-increasing popularity for their great
capability of reducing storage burden as well as quickening inference time. However, there …

DCP–NAS: Discrepant Child–Parent Neural Architecture Search for 1-bit CNNs

Y Li, S Xu, X Cao, L Zhuo, B Zhang, T Wang… - International Journal of …, 2023 - Springer
Neural architecture search (NAS) proves to be among the effective approaches for many
tasks by generating an application-adaptive neural architecture, which is still challenged by …

BEV-LGKD: A Unified LiDAR-Guided Knowledge Distillation Framework for Multi-View BEV 3D Object Detection

J Li, M Lu, J Liu, Y Guo, Y Du, L Du… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Recently, the Bird's-Eye-View (BEV) representation has gained increasing attention in multi-
view 3D object detection, demonstrating promising applications in autonomous driving …

Semantic RGB-D Image Synthesis

S Li, R Li, J Gall - Proceedings of the IEEE/CVF International …, 2023 - openaccess.thecvf.com
Collecting diverse sets of training images for RGB-D semantic image segmentation is not
always possible. In particular, when robots need to operate in privacy-sensitive areas like …

Bi-ViT: Pushing the Limit of Vision Transformer Quantization

Y Li, S Xu, M Lin, X Cao, C Liu, X Sun… - Proceedings of the AAAI …, 2024 - ojs.aaai.org
Vision transformers (ViTs) quantization offers a promising prospect to facilitate deploying
large pre-trained networks on resource-limited devices. Fully-binarized ViTs (Bi-ViT) that …

Joint-Guided Distillation Binary Neural Network via Dynamic Channel-Wise Diversity Enhancement for Object Detection

Y Xie, X Hou, Y Guo, X Wang… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Through truncating the weights and activations of a deep neural network, conventional
binary quantization imposes limitations on the representation capability of the network …

Ulit-BiDet: An Ultra-lightweight Object Detector for SAR Images Based on Binary Neural Networks

H Pu, Z Zhu, Q Hu, D Wang - IEEE Transactions on Geoscience …, 2024 - ieeexplore.ieee.org
Synthetic aperture radar (SAR) target detection has extensively utilized convolutional neural
networks (CNNs). Nonetheless, CNN-based methods often achieve favorable detection …