- 学术资源搜索

Perceptual video quality assessment: A survey

X Min, H Duan, W Sun, Y Zhu, G Zhai - Science China Information …, 2024 - Springer

Perceptual video quality assessment plays a vital role in the field of video processing due to
the existence of quality degradations introduced in various stages of video signal …

被引用次数：74 相关文章所有 2 个版本

[PDF] thecvf.com

NTIRE 2024 challenge on short-form UGC video quality assessment: Methods and results

X Li, K Yuan, Y Pei, Y Lu, M Sun… - Proceedings of the …, 2024 - openaccess.thecvf.com

This paper reviews the NTIRE 2024 Challenge on Shortform UGC Video Quality
Assessment (S-UGC VQA) where various excellent solutions are submitted and evaluated …

被引用次数：22 相关文章所有 3 个版本

[PDF] thecvf.com

Vbench: Comprehensive benchmark suite for video generative models

Z Huang, Y He, J Yu, F Zhang, C Si… - Proceedings of the …, 2024 - openaccess.thecvf.com

Video generation has witnessed significant advancements yet evaluating these models
remains a challenge. A comprehensive evaluation benchmark for video generation is …

被引用次数：184 相关文章所有 4 个版本

[PDF] thecvf.com

Videocrafter2: Overcoming data limitations for high-quality video diffusion models

H Chen, Y Zhang, X Cun, M Xia… - Proceedings of the …, 2024 - openaccess.thecvf.com

Text-to-video generation aims to produce a video based on a given prompt. Recently
several commercial video models have been able to generate plausible videos with minimal …

被引用次数：165 相关文章所有 3 个版本

[PDF] thecvf.com

Evalcrafter: Benchmarking and evaluating large video generation models

Y Liu, X Cun, X Liu, X Wang, Y Zhang… - Proceedings of the …, 2024 - openaccess.thecvf.com

The vision and language generative models have been overgrown in recent years. For
video generation various open-sourced models and public-available services have been …

被引用次数：86 相关文章所有 3 个版本

[PDF] thecvf.com

Q-instruct: Improving low-level visual abilities for multi-modality foundation models

H Wu, Z Zhang, E Zhang, C Chen… - Proceedings of the …, 2024 - openaccess.thecvf.com

Multi-modality large language models (MLLMs) as represented by GPT-4V have introduced
a paradigm shift for visual perception and understanding tasks that a variety of abilities can …

被引用次数：62 相关文章所有 4 个版本

[PDF] arxiv.org

Towards open-ended visual quality comparison

H Wu, H Zhu, Z Zhang, E Zhang, C Chen, L Liao… - … on Computer Vision, 2025 - Springer

Comparative settings (eg. pairwise choice, listwise ranking) have been adopted by a wide
range of subjective studies for image quality assessment (IQA), as it inherently standardizes …

被引用次数：35 相关文章所有 2 个版本

[PDF] arxiv.org

Q-bench: A benchmark for general-purpose foundation models on low-level vision

H Wu, Z Zhang, E Zhang, C Chen, L Liao… - arXiv preprint arXiv …, 2023 - arxiv.org

The rapid evolution of Multi-modality Large Language Models (MLLMs) has catalyzed a shift
in computer vision from specialized models to general-purpose foundation models …

被引用次数：107 相关文章所有 5 个版本

[PDF] arxiv.org

Id-animator: Zero-shot identity-preserving human video generation

X He, Q Liu, S Qian, X Wang, T Hu, K Cao… - arXiv preprint arXiv …, 2024 - arxiv.org

Generating high-fidelity human video with specified identities has attracted significant
attention in the content generation community. However, existing techniques struggle to …

被引用次数：26 相关文章所有 2 个版本

[PDF] arxiv.org

Videoscore: Building automatic metrics to simulate fine-grained human feedback for video generation

X He, D Jiang, G Zhang, M Ku, A Soni, S Siu… - arXiv preprint arXiv …, 2024 - arxiv.org

The recent years have witnessed great advances in video generation. However, the
development of automatic video metrics is lagging significantly behind. None of the existing …

被引用次数：19 相关文章所有 3 个版本

高级搜索

QQ 群

Perceptual video quality assessment: A survey

NTIRE 2024 challenge on short-form UGC video quality assessment: Methods and results

Vbench: Comprehensive benchmark suite for video generative models

Videocrafter2: Overcoming data limitations for high-quality video diffusion models

Evalcrafter: Benchmarking and evaluating large video generation models

Q-instruct: Improving low-level visual abilities for multi-modality foundation models

Towards open-ended visual quality comparison

Q-bench: A benchmark for general-purpose foundation models on low-level vision

Id-animator: Zero-shot identity-preserving human video generation

Videoscore: Building automatic metrics to simulate fine-grained human feedback for video generation

引用