Adanat: Exploring adaptive policy for token-based image generation

Z Ni, Y Wang, R Zhou, R Lu, J Guo, J Hu, Z Liu… - … on Computer Vision, 2024 - Springer
Recent studies have demonstrated the effectiveness of token-based methods for visual
content generation. As a representative work, non-autoregressive Transformers (NATs) are …

Powerful and flexible: Personalized text-to-image generation via reinforcement learning

F Wei, W Zeng, Z Li, D Yin, L Duan, W Li - European Conference on …, 2024 - Springer
Personalized text-to-image models allow users to generate varied styles of images
(specified with a sentence) for an object (specified with a set of reference images). While …

Gen-Drive: Enhancing Diffusion Generative Driving Policies with Reward Modeling and Reinforcement Learning Fine-tuning

Z Huang, X Weng, M Igl, Y Chen, Y Cao… - arXiv preprint arXiv …, 2024 - arxiv.org
Autonomous driving necessitates the ability to reason about future interactions between
traffic agents and to make informed evaluations for planning. This paper introduces the\textit …

Towards Reliable Advertising Image Generation Using Human Feedback

Z Du, W Feng, H Wang, Y Li, J Wang, J Li… - … on Computer Vision, 2024 - Springer
In the e-commerce realm, compelling advertising images are pivotal for attracting customer
attention. While generative models automate image generation, they often produce …

Improving dynamic object interactions in text-to-video generation with ai feedback

H Furuta, H Zen, D Schuurmans, A Faust… - arXiv preprint arXiv …, 2024 - arxiv.org
Large text-to-video models hold immense potential for a wide range of downstream
applications. However, these models struggle to accurately depict dynamic object …

Auditing and instructing text-to-image generation models on fairness

F Friedrich, M Brack, L Struppek, D Hintersdorf… - AI and Ethics, 2024 - Springer
Generative AI models have recently achieved astonishing results in quality and are
consequently employed in a fast-growing number of applications. However, since they are …