Toward verifiable and reproducible human evaluation for text-to-image generation

J Xu, X Liu, Y Wu, Y Tong, Q Li… - Advances in …, 2024 - proceedings.neurips.cc

We present a comprehensive solution to learn and improve text-to-image models from
human preference feedback. To begin with, we build ImageReward---the first general …

被引用次数：171 相关文章所有 6 个版本

[PDF] neurips.cc

Holistic evaluation of text-to-image models

T Lee, M Yasunaga, C Meng, Y Mai… - Advances in …, 2024 - proceedings.neurips.cc

The stunning qualitative improvement of text-to-image models has led to their widespread
attention and adoption. However, we lack a comprehensive quantitative understanding of …

被引用次数：44 相关文章所有 6 个版本

[PDF] thecvf.com

Vbench: Comprehensive benchmark suite for video generative models

Z Huang, Y He, J Yu, F Zhang, C Si… - Proceedings of the …, 2024 - openaccess.thecvf.com

Video generation has witnessed significant advancements yet evaluating these models
remains a challenge. A comprehensive evaluation benchmark for video generation is …

被引用次数：36 相关文章所有 4 个版本

[PDF] neurips.cc

Fetv: A benchmark for fine-grained evaluation of open-domain text-to-video generation

Y Liu, L Li, S Ren, R Gao, S Li… - Advances in Neural …, 2024 - proceedings.neurips.cc

Recently, open-domain text-to-video (T2V) generation models have made remarkable
progress. However, the promising results are mainly shown by the qualitative cases of …

被引用次数：17 相关文章所有 5 个版本

Exposing flaws of generative model evaluation metrics and their unfair treatment of diffusion models

G Stein, J Cresswell, R Hosseinzadeh… - Advances in …, 2024 - proceedings.neurips.cc

We systematically study a wide variety of generative models spanning semantically-diverse
image datasets to understand and improve the feature extractors and metrics used to …

被引用次数：33 相关文章所有 5 个版本

[HTML] sciencedirect.com

[HTML][HTML] Investigating students' cognitive processes in generative AI-assisted digital multimodal composing and traditional writing

M Liu, LJ Zhang, C Biebricher - Computers & Education, 2024 - Elsevier

Recently, generative artificial intelligence (AI)-powered chatbots such as ChatGPT and Bing
Chat have garnered increasing attention on a global scale. Previous studies have focused …

被引用次数：10 相关文章所有 2 个版本

[PDF] arxiv.org

Evaluating text-to-visual generation with image-to-text generation

Z Lin, D Pathak, B Li, J Li, X Xia, G Neubig… - arXiv preprint arXiv …, 2024 - arxiv.org

Despite significant progress in generative AI, comprehensive evaluation remains
challenging because of the lack of effective metrics and standardized benchmarks. For …

被引用次数：11 相关文章所有 2 个版本

[PDF] thecvf.com

Evaluating and Improving Compositional Text-to-Visual Generation

B Li, Z Lin, D Pathak, J Li, Y Fei, K Wu… - Proceedings of the …, 2024 - openaccess.thecvf.com

While text-to-visual models now produce photo-realistic images and videos they struggle
with compositional text prompts involving attributes relationships and higher-order …

被引用次数：1 相关文章

[PDF] thecvf.com

Rich human feedback for text-to-image generation

Y Liang, J He, G Li, P Li, A Klimovskiy… - Proceedings of the …, 2024 - openaccess.thecvf.com

Abstract Recent Text-to-Image (T2I) generation models such as Stable Diffusion and Imagen
have made significant progress in generating high-resolution images based on text …

被引用次数：8 相关文章所有 3 个版本

[PDF] arxiv.org

DOCCI: Descriptions of Connected and Contrasting Images

Y Onoe, S Rane, Z Berger, Y Bitton, J Cho… - arXiv preprint arXiv …, 2024 - arxiv.org

Vision-language datasets are vital for both text-to-image (T2I) and image-to-text (I2T)
research. However, current datasets lack descriptions with fine-grained detail that would …

被引用次数：3 相关文章所有 2 个版本

高级搜索

QQ 群