H Li, J Zhu, X Jiang, X Zhu, H Li… - Proceedings of the …, 2023 - openaccess.thecvf.com
Despite the remarkable success of foundation models, their task-specific fine-tuning paradigm makes them inconsistent with the goal of general perception modeling. The key to …
Biological intelligence systems of animals perceive the world by integrating information in different modalities and processing simultaneously for various tasks. In contrast, current …
We present the All-Seeing (AS) project: a large-scale data and model for recognizing and understanding everything in the open world. Using a scalable data engine that incorporates …
W Su, X Zhu, C Tao, L Lu, B Li… - Proceedings of the …, 2023 - openaccess.thecvf.com
To effectively exploit the potential of large-scale models, various pre-training strategies supported by massive data from different sources are proposed, including supervised pre …
To build an artificial neural network like the biological intelligence system, recent works have unified numerous tasks into a generalist model, which can process various tasks with shared …
X Pan, T Ye, D Han, S Song… - Advances in Neural …, 2022 - proceedings.neurips.cc
Recent years have witnessed the fast development of large-scale pre-training frameworks that can extract multi-modal representations in a unified form and achieve promising …
We introduce a generic framework that reduces the computational cost of object detection while retaining accuracy for scenarios where objects with varied sizes appear in high …
W Wu, K Cao, C Li, C Qian… - Proceedings of the IEEE …, 2019 - openaccess.thecvf.com
Unsupervised image-to-image translation aims at learning a mapping between two visual domains. However, learning a translation across large geometry variations al-ways ends up …