Review on panoramic imaging and its applications in scene understanding

S Gao, K Yang, H Shi, K Wang… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
With the rapid development of high-speed communication and artificial intelligence
technologies, human perception of real-world scenes is no longer limited to the use of small …

Deep learning for fluid velocity field estimation: A review

C Yu, X Bi, Y Fan - Ocean Engineering, 2023 - Elsevier
Deep learning technique, has made tremendous progress in fluid mechanics in recent
years, because of its mighty feature extraction capacity from complicated and massive fluid …

Stable video diffusion: Scaling latent video diffusion models to large datasets

A Blattmann, T Dockhorn, S Kulal… - arXiv preprint arXiv …, 2023 - arxiv.org
We present Stable Video Diffusion-a latent video diffusion model for high-resolution, state-of-
the-art text-to-video and image-to-video generation. Recently, latent diffusion models trained …

Pix2video: Video editing using image diffusion

D Ceylan, CHP Huang, NJ Mitra - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Image diffusion models, trained on massive image collections, have emerged as the most
versatile image generator model in terms of quality and diversity. They support inverting real …

Videocomposer: Compositional video synthesis with motion controllability

X Wang, H Yuan, S Zhang, D Chen… - Advances in …, 2024 - proceedings.neurips.cc
The pursuit of controllability as a higher standard of visual content creation has yielded
remarkable progress in customizable image synthesis. However, achieving controllable …

Iterative geometry encoding volume for stereo matching

G Xu, X Wang, X Ding, X Yang - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Abstract Recurrent All-Pairs Field Transforms (RAFT) has shown great potentials in
matching tasks. However, all-pairs correlations lack non-local geometry knowledge and …

Dynibar: Neural dynamic image-based rendering

Z Li, Q Wang, F Cole, R Tucker… - Proceedings of the …, 2023 - openaccess.thecvf.com
We address the problem of synthesizing novel views from a monocular video depicting a
complex dynamic scene. State-of-the-art methods based on temporally varying Neural …

Drag your gan: Interactive point-based manipulation on the generative image manifold

X Pan, A Tewari, T Leimkühler, L Liu, A Meka… - ACM SIGGRAPH 2023 …, 2023 - dl.acm.org
Synthesizing visual content that meets users' needs often requires flexible and precise
controllability of the pose, shape, expression, and layout of the generated objects. Existing …

Tracking everything everywhere all at once

Q Wang, YY Chang, R Cai, Z Li… - Proceedings of the …, 2023 - openaccess.thecvf.com
We present a new test-time optimization method for estimating dense and long-range motion
from a video sequence. Prior optical flow or particle video tracking algorithms typically …

Suds: Scalable urban dynamic scenes

H Turki, JY Zhang, F Ferroni… - Proceedings of the …, 2023 - openaccess.thecvf.com
We extend neural radiance fields (NeRFs) to dynamic large-scale urban scenes. Prior work
tends to reconstruct single video clips of short durations (up to 10 seconds). Two reasons …