Video object segmentation and tracking: A survey

R Yao, G Lin, S Xia, J Zhao, Y Zhou - ACM Transactions on Intelligent …, 2020 - dl.acm.org
Object segmentation and object tracking are fundamental research areas in the computer
vision community. These two topics are difficult to handle some common challenges, such …

[HTML][HTML] A survey on GANs for computer vision: Recent research, analysis and taxonomy

G Iglesias, E Talavera, A Díaz-Álvarez - Computer Science Review, 2023 - Elsevier
In the last few years, there have been several revolutions in the field of deep learning,
mainly headlined by the large impact of Generative Adversarial Networks (GANs). GANs not …

Drag your gan: Interactive point-based manipulation on the generative image manifold

X Pan, A Tewari, T Leimkühler, L Liu, A Meka… - ACM SIGGRAPH 2023 …, 2023 - dl.acm.org
Synthesizing visual content that meets users' needs often requires flexible and precise
controllability of the pose, shape, expression, and layout of the generated objects. Existing …

Tracking everything everywhere all at once

Q Wang, YY Chang, R Cai, Z Li… - Proceedings of the …, 2023 - openaccess.thecvf.com
We present a new test-time optimization method for estimating dense and long-range motion
from a video sequence. Prior optical flow or particle video tracking algorithms typically …

Pointodyssey: A large-scale synthetic dataset for long-term point tracking

Y Zheng, AW Harley, B Shen… - Proceedings of the …, 2023 - openaccess.thecvf.com
We introduce PointOdyssey, a large-scale synthetic dataset, and data generation framework,
for the training and evaluation of long-term fine-grained tracking algorithms. Our goal is to …

Kitti-360: A novel dataset and benchmarks for urban scene understanding in 2d and 3d

Y Liao, J Xie, A Geiger - IEEE Transactions on Pattern Analysis …, 2022 - ieeexplore.ieee.org
For the last few decades, several major subfields of artificial intelligence including computer
vision, graphics, and robotics have progressed largely independently from each other …

Banmo: Building animatable 3d neural models from many casual videos

G Yang, M Vo, N Neverova… - Proceedings of the …, 2022 - openaccess.thecvf.com
Prior work for articulated 3D shape reconstruction often relies on specialized multi-view and
depth sensors or pre-built deformable 3D models. Such methods do not scale to diverse sets …

Image inpainting for irregular holes using partial convolutions

G Liu, FA Reda, KJ Shih, TC Wang… - Proceedings of the …, 2018 - openaccess.thecvf.com
Existing deep learning based image inpainting methods use a standard convolutional
network over the corrupted image, using convolutional filter responses conditioned on both …

Particle video revisited: Tracking through occlusions using point trajectories

AW Harley, Z Fang, K Fragkiadaki - European Conference on Computer …, 2022 - Springer
Tracking pixels in videos is typically studied as an optical flow estimation problem, where
every pixel is described with a displacement vector that locates it in the next frame. Even …

Df-net: Unsupervised joint learning of depth and flow using cross-task consistency

Y Zou, Z Luo, JB Huang - Proceedings of the European …, 2018 - openaccess.thecvf.com
We present an unsupervised learning framework for simultaneously training single-view
depth prediction and optical flow estimation models using unlabeled video sequences …