Imagereward: Learning and evaluating human preferences for text-to-image generation

J Xu, X Liu, Y Wu, Y Tong, Q Li… - Advances in …, 2024 - proceedings.neurips.cc
We present a comprehensive solution to learn and improve text-to-image models from
human preference feedback. To begin with, we build ImageReward---the first general …

Holistic evaluation of text-to-image models

T Lee, M Yasunaga, C Meng, Y Mai… - Advances in …, 2024 - proceedings.neurips.cc
The stunning qualitative improvement of text-to-image models has led to their widespread
attention and adoption. However, we lack a comprehensive quantitative understanding of …

Vbench: Comprehensive benchmark suite for video generative models

Z Huang, Y He, J Yu, F Zhang, C Si… - Proceedings of the …, 2024 - openaccess.thecvf.com
Video generation has witnessed significant advancements yet evaluating these models
remains a challenge. A comprehensive evaluation benchmark for video generation is …

Fetv: A benchmark for fine-grained evaluation of open-domain text-to-video generation

Y Liu, L Li, S Ren, R Gao, S Li… - Advances in Neural …, 2024 - proceedings.neurips.cc
Recently, open-domain text-to-video (T2V) generation models have made remarkable
progress. However, the promising results are mainly shown by the qualitative cases of …

Exposing flaws of generative model evaluation metrics and their unfair treatment of diffusion models

G Stein, J Cresswell, R Hosseinzadeh… - Advances in …, 2024 - proceedings.neurips.cc
We systematically study a wide variety of generative models spanning semantically-diverse
image datasets to understand and improve the feature extractors and metrics used to …

[HTML][HTML] Investigating students' cognitive processes in generative AI-assisted digital multimodal composing and traditional writing

M Liu, LJ Zhang, C Biebricher - Computers & Education, 2024 - Elsevier
Recently, generative artificial intelligence (AI)-powered chatbots such as ChatGPT and Bing
Chat have garnered increasing attention on a global scale. Previous studies have focused …

Evaluating text-to-visual generation with image-to-text generation

Z Lin, D Pathak, B Li, J Li, X Xia, G Neubig… - arXiv preprint arXiv …, 2024 - arxiv.org
Despite significant progress in generative AI, comprehensive evaluation remains
challenging because of the lack of effective metrics and standardized benchmarks. For …

Evaluating and Improving Compositional Text-to-Visual Generation

B Li, Z Lin, D Pathak, J Li, Y Fei, K Wu… - Proceedings of the …, 2024 - openaccess.thecvf.com
While text-to-visual models now produce photo-realistic images and videos they struggle
with compositional text prompts involving attributes relationships and higher-order …

Rich human feedback for text-to-image generation

Y Liang, J He, G Li, P Li, A Klimovskiy… - Proceedings of the …, 2024 - openaccess.thecvf.com
Abstract Recent Text-to-Image (T2I) generation models such as Stable Diffusion and Imagen
have made significant progress in generating high-resolution images based on text …

DOCCI: Descriptions of Connected and Contrasting Images

Y Onoe, S Rane, Z Berger, Y Bitton, J Cho… - arXiv preprint arXiv …, 2024 - arxiv.org
Vision-language datasets are vital for both text-to-image (T2I) and image-to-text (I2T)
research. However, current datasets lack descriptions with fine-grained detail that would …