Raft: Recurrent all-pairs field transforms for optical flow

S Gao, K Yang, H Shi, K Wang… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org

With the rapid development of high-speed communication and artificial intelligence
technologies, human perception of real-world scenes is no longer limited to the use of small …

被引用次数：79 相关文章所有 8 个版本

Deep learning for fluid velocity field estimation: A review

C Yu, X Bi, Y Fan - Ocean Engineering, 2023 - Elsevier

Deep learning technique, has made tremendous progress in fluid mechanics in recent
years, because of its mighty feature extraction capacity from complicated and massive fluid …

被引用次数：46 相关文章所有 2 个版本

[PDF] arxiv.org

Stable video diffusion: Scaling latent video diffusion models to large datasets

A Blattmann, T Dockhorn, S Kulal… - arXiv preprint arXiv …, 2023 - arxiv.org

We present Stable Video Diffusion-a latent video diffusion model for high-resolution, state-of-
the-art text-to-video and image-to-video generation. Recently, latent diffusion models trained …

被引用次数：302 相关文章所有 2 个版本

[PDF] thecvf.com

Pix2video: Video editing using image diffusion

D Ceylan, CHP Huang, NJ Mitra - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com

Image diffusion models, trained on massive image collections, have emerged as the most
versatile image generator model in terms of quality and diversity. They support inverting real …

被引用次数：157 相关文章所有 6 个版本

Videocomposer: Compositional video synthesis with motion controllability

X Wang, H Yuan, S Zhang, D Chen… - Advances in …, 2024 - proceedings.neurips.cc

The pursuit of controllability as a higher standard of visual content creation has yielded
remarkable progress in customizable image synthesis. However, achieving controllable …

被引用次数：178 相关文章所有 6 个版本

[PDF] thecvf.com

Iterative geometry encoding volume for stereo matching

G Xu, X Wang, X Ding, X Yang - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com

Abstract Recurrent All-Pairs Field Transforms (RAFT) has shown great potentials in
matching tasks. However, all-pairs correlations lack non-local geometry knowledge and …

被引用次数：131 相关文章所有 5 个版本

[PDF] thecvf.com

Dynibar: Neural dynamic image-based rendering

Z Li, Q Wang, F Cole, R Tucker… - Proceedings of the …, 2023 - openaccess.thecvf.com

We address the problem of synthesizing novel views from a monocular video depicting a
complex dynamic scene. State-of-the-art methods based on temporally varying Neural …

被引用次数：136 相关文章所有 9 个版本

[PDF] acm.org Full View

Drag your gan: Interactive point-based manipulation on the generative image manifold

X Pan, A Tewari, T Leimkühler, L Liu, A Meka… - ACM SIGGRAPH 2023 …, 2023 - dl.acm.org

Synthesizing visual content that meets users' needs often requires flexible and precise
controllability of the pose, shape, expression, and layout of the generated objects. Existing …

被引用次数：161 相关文章所有 7 个版本

[PDF] thecvf.com

Tracking everything everywhere all at once

Q Wang, YY Chang, R Cai, Z Li… - Proceedings of the …, 2023 - openaccess.thecvf.com

We present a new test-time optimization method for estimating dense and long-range motion
from a video sequence. Prior optical flow or particle video tracking algorithms typically …

被引用次数：94 相关文章所有 5 个版本

[PDF] thecvf.com

Suds: Scalable urban dynamic scenes

H Turki, JY Zhang, F Ferroni… - Proceedings of the …, 2023 - openaccess.thecvf.com

We extend neural radiance fields (NeRFs) to dynamic large-scale urban scenes. Prior work
tends to reconstruct single video clips of short durations (up to 10 seconds). Two reasons …

被引用次数：83 相关文章所有 6 个版本

高级搜索

QQ 群