A survey on video diffusion models

Z Xing, Q Feng, H Chen, Q Dai, H Hu, H Xu… - ACM Computing …, 2024 - dl.acm.org
The recent wave of AI-generated content (AIGC) has witnessed substantial success in
computer vision, with the diffusion model playing a crucial role in this achievement. Due to …

A survey on generative diffusion models

H Cao, C Tan, Z Gao, Y Xu, G Chen… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Deep generative models have unlocked another profound realm of human creativity. By
capturing and generalizing patterns within data, we have entered the epoch of all …

A survey on generative ai and llm for video generation, understanding, and streaming

P Zhou, L Wang, Z Liu, Y Hao, P Hui, S Tarkoma… - arXiv preprint arXiv …, 2024 - arxiv.org
This paper offers an insightful examination of how currently top-trending AI technologies, ie,
generative artificial intelligence (Generative AI) and large language models (LLMs), are …

Large motion model for unified multi-modal motion generation

M Zhang, D Jin, C Gu, F Hong, Z Cai, J Huang… - … on Computer Vision, 2025 - Springer
Human motion generation, a cornerstone technique in animation and video production, has
widespread applications in various tasks like text-to-motion and music-to-dance. Previous …

Dae-talker: High fidelity speech-driven talking face generation with diffusion autoencoder

C Du, Q Chen, T He, X Tan, X Chen, K Yu… - Proceedings of the 31st …, 2023 - dl.acm.org
While recent research has made significant progress in speech-driven talking face
generation, the quality of the generated video still lags behind that of real recordings. One …

Deepfake generation and detection: A benchmark and survey

G Pei, J Zhang, M Hu, Z Zhang, C Wang, Y Wu… - arXiv preprint arXiv …, 2024 - arxiv.org
Deepfake is a technology dedicated to creating highly realistic facial images and videos
under specific conditions, which has significant application potential in fields such as …

Motion-conditioned diffusion model for controllable video synthesis

TS Chen, CH Lin, HY Tseng, TY Lin… - arXiv preprint arXiv …, 2023 - arxiv.org
Recent advancements in diffusion models have greatly improved the quality and diversity of
synthesized content. To harness the expressive power of diffusion models, researchers have …

Face generation and editing with stylegan: A survey

A Melnik, M Miasayedzenkau… - … on Pattern Analysis …, 2024 - ieeexplore.ieee.org
Our goal with this survey is to provide an overview of the state of the art deep learning
methods for face generation and editing using StyleGAN. The survey covers the evolution of …

Language-driven Object Fusion into Neural Radiance Fields with Pose-Conditioned Dataset Updates

KC Shum, J Kim, BS Hua… - Proceedings of the …, 2024 - openaccess.thecvf.com
Neural radiance field (NeRF) is an emerging technique for 3D scene reconstruction and
modeling. However current NeRF-based methods are limited in the capabilities of adding or …

CloudFixer: Test-Time Adaptation for 3D Point Clouds via Diffusion-Guided Geometric Transformation

H Shim, C Kim, E Yang - European Conference on Computer Vision, 2025 - Springer
Abstract 3D point clouds captured from real-world sensors frequently encompass noisy
points due to various obstacles, such as occlusion, limited resolution, and variations in …