PhyBench: A Physical Commonsense Benchmark for Evaluating Text-to-Image Models

F Meng, W Shao, L Luo, Y Wang, Y Chen, Q Lu… - arXiv preprint arXiv …, 2024 - arxiv.org
Text-to-image (T2I) models have made substantial progress in generating images from
textual prompts. However, they frequently fail to produce images consistent with physical …