Slotdiffusion: Object-centric generative modeling with diffusion models

Z Wu, J Hu, W Lu, I Gilitschenski… - Advances in Neural …, 2023 - proceedings.neurips.cc
Object-centric learning aims to represent visual data with a set of object entities (aka slots),
providing structured representations that enable systematic generalization. Leveraging …

Disco-diff: Enhancing continuous diffusion models with discrete latents

Y Xu, G Corso, T Jaakkola, A Vahdat… - arXiv preprint arXiv …, 2024 - arxiv.org
Diffusion models (DMs) have revolutionized generative learning. They utilize a diffusion
process to encode data into a simple Gaussian distribution. However, encoding a complex …

Diffusion of thoughts: Chain-of-thought reasoning in diffusion language models

J Ye, S Gong, L Chen, L Zheng, J Gao, H Shi… - arXiv preprint arXiv …, 2024 - arxiv.org
Recently, diffusion models have garnered significant interest in the field of text processing
due to their many potential advantages compared to conventional autoregressive models. In …

Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey

L Chen, Z Wang, S Ren, L Li, H Zhao, Y Li… - arXiv preprint arXiv …, 2024 - arxiv.org
Building on the foundations of language modeling in natural language processing, Next
Token Prediction (NTP) has evolved into a versatile training objective for machine learning …

Trustworthy Large Models in Vision: A Survey

Z Guo, L Xu, J Liu - arXiv preprint arXiv:2311.09680, 2023 - arxiv.org
The rapid progress of Large Models (LMs) has recently revolutionized various fields of deep
learning with remarkable grades, ranging from Natural Language Processing (NLP) to …