[HTML][HTML] Metaverse: Perspectives from graphics, interactions and visualization

Y Zhao, J Jiang, Y Chen, R Liu, Y Yang, X Xue, S Chen - Visual Informatics, 2022 - Elsevier
The metaverse is a visual world that blends the physical world and digital world. At present,
the development of the metaverse is still in the early stage, and there lacks a framework for …

Self-supervised learning for videos: A survey

MC Schiappa, YS Rawat, M Shah - ACM Computing Surveys, 2023 - dl.acm.org
The remarkable success of deep learning in various domains relies on the availability of
large-scale annotated datasets. However, obtaining annotations is expensive and requires …

Decomposing nerf for editing via feature field distillation

S Kobayashi, E Matsumoto… - Advances in Neural …, 2022 - proceedings.neurips.cc
Emerging neural radiance fields (NeRF) are a promising scene representation for computer
graphics, enabling high-quality 3D reconstruction and novel view synthesis from image …

Voxformer: Sparse voxel transformer for camera-based 3d semantic scene completion

Y Li, Z Yu, C Choy, C Xiao, JM Alvarez… - Proceedings of the …, 2023 - openaccess.thecvf.com
Humans can easily imagine the complete 3D geometry of occluded objects and scenes. This
appealing ability is vital for recognition and understanding. To enable such capability in AI …

Not all points are equal: Learning highly efficient point-based detectors for 3d lidar point clouds

Y Zhang, Q Hu, G Xu, Y Ma, J Wan… - Proceedings of the …, 2022 - openaccess.thecvf.com
We study the problem of efficient object detection of 3D LiDAR point clouds. To reduce the
memory and computational cost, existing point-based pipelines usually adopt task-agnostic …

Rethinking range view representation for lidar segmentation

L Kong, Y Liu, R Chen, Y Ma, X Zhu… - Proceedings of the …, 2023 - openaccess.thecvf.com
LiDAR segmentation is crucial for autonomous driving perception. Recent trends favor point-
or voxel-based methods as they often yield better performance than the traditional range …

Robo3d: Towards robust and reliable 3d perception against corruptions

L Kong, Y Liu, X Li, R Chen, W Zhang… - Proceedings of the …, 2023 - openaccess.thecvf.com
The robustness of 3D perception systems under natural corruptions from environments and
sensors is pivotal for safety-critical applications. Existing large-scale 3D perception datasets …

Clip2scene: Towards label-efficient 3d scene understanding by clip

R Chen, Y Liu, L Kong, X Zhu, Y Ma… - Proceedings of the …, 2023 - openaccess.thecvf.com
Abstract Contrastive Language-Image Pre-training (CLIP) achieves promising results in 2D
zero-shot and few-shot learning. Despite the impressive performance in 2D, applying CLIP …

Vrt: A video restoration transformer

J Liang, J Cao, Y Fan, K Zhang… - … on Image Processing, 2024 - ieeexplore.ieee.org
Video restoration aims to restore high-quality frames from low-quality frames. Different from
single image restoration, video restoration generally requires to utilize temporal information …

3D object detection for autonomous driving: A survey

R Qian, X Lai, X Li - Pattern Recognition, 2022 - Elsevier
Autonomous driving is regarded as one of the most promising remedies to shield human
beings from severe crashes. To this end, 3D object detection serves as the core basis of …