Diffusion video autoencoders: Toward temporally consistent face video editing via disentangled...

Z Xing, Q Feng, H Chen, Q Dai, H Hu, H Xu… - ACM Computing …, 2024 - dl.acm.org

The recent wave of AI-generated content (AIGC) has witnessed substantial success in
computer vision, with the diffusion model playing a crucial role in this achievement. Due to …

被引用次数：76 相关文章所有 3 个版本

[PDF] arxiv.org

A survey on generative diffusion models

H Cao, C Tan, Z Gao, Y Xu, G Chen… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org

Deep generative models have unlocked another profound realm of human creativity. By
capturing and generalizing patterns within data, we have entered the epoch of all …

被引用次数：379 相关文章所有 5 个版本

[PDF] arxiv.org

A survey on generative ai and llm for video generation, understanding, and streaming

P Zhou, L Wang, Z Liu, Y Hao, P Hui, S Tarkoma… - arXiv preprint arXiv …, 2024 - arxiv.org

This paper offers an insightful examination of how currently top-trending AI technologies, ie,
generative artificial intelligence (Generative AI) and large language models (LLMs), are …

被引用次数：26 相关文章所有 8 个版本

[PDF] arxiv.org

Large motion model for unified multi-modal motion generation

M Zhang, D Jin, C Gu, F Hong, Z Cai, J Huang… - … on Computer Vision, 2025 - Springer

Human motion generation, a cornerstone technique in animation and video production, has
widespread applications in various tasks like text-to-motion and music-to-dance. Previous …

被引用次数：13 相关文章所有 2 个版本

[PDF] arxiv.org

Dae-talker: High fidelity speech-driven talking face generation with diffusion autoencoder

C Du, Q Chen, T He, X Tan, X Chen, K Yu… - Proceedings of the 31st …, 2023 - dl.acm.org

While recent research has made significant progress in speech-driven talking face
generation, the quality of the generated video still lags behind that of real recordings. One …

被引用次数：41 相关文章所有 4 个版本

[PDF] arxiv.org

Deepfake generation and detection: A benchmark and survey

G Pei, J Zhang, M Hu, Z Zhang, C Wang, Y Wu… - arXiv preprint arXiv …, 2024 - arxiv.org

Deepfake is a technology dedicated to creating highly realistic facial images and videos
under specific conditions, which has significant application potential in fields such as …

被引用次数：24 相关文章所有 2 个版本

[PDF] arxiv.org

Motion-conditioned diffusion model for controllable video synthesis

TS Chen, CH Lin, HY Tseng, TY Lin… - arXiv preprint arXiv …, 2023 - arxiv.org

Recent advancements in diffusion models have greatly improved the quality and diversity of
synthesized content. To harness the expressive power of diffusion models, researchers have …

被引用次数：55 相关文章所有 2 个版本

[PDF] ieee.org

Face generation and editing with stylegan: A survey

A Melnik, M Miasayedzenkau… - … on Pattern Analysis …, 2024 - ieeexplore.ieee.org

Our goal with this survey is to provide an overview of the state of the art deep learning
methods for face generation and editing using StyleGAN. The survey covers the evolution of …

被引用次数：33 相关文章所有 8 个版本

[PDF] thecvf.com

Language-driven Object Fusion into Neural Radiance Fields with Pose-Conditioned Dataset Updates

KC Shum, J Kim, BS Hua… - Proceedings of the …, 2024 - openaccess.thecvf.com

Neural radiance field (NeRF) is an emerging technique for 3D scene reconstruction and
modeling. However current NeRF-based methods are limited in the capabilities of adding or …

被引用次数：6 相关文章所有 3 个版本

[PDF] arxiv.org

CloudFixer: Test-Time Adaptation for 3D Point Clouds via Diffusion-Guided Geometric Transformation

H Shim, C Kim, E Yang - European Conference on Computer Vision, 2025 - Springer

Abstract 3D point clouds captured from real-world sensors frequently encompass noisy
points due to various obstacles, such as occlusion, limited resolution, and variations in …

被引用次数：1 相关文章所有 6 个版本

高级搜索

QQ 群