Object Recognition as Next Token Prediction

K Yue, BC Chen, J Geiping, H Li… - Proceedings of the …, 2024 - openaccess.thecvf.com
We present an approach to pose object recognition as next token prediction. The idea is to
apply a language decoder that auto-regressively predicts the text tokens from image …