K Song, S Zhang, T Wang - Proceedings of the AAAI Conference on …, 2024 - ojs.aaai.org
The development of autoregressive modeling (AM) in computer vision lags behind natural
language processing (NLP) in self-supervised pre-training. This is mainly caused by the …