State of the art on diffusion models for visual computing

R Po, W Yifan, V Golyanik, K Aberman… - Computer Graphics …, 2024 - Wiley Online Library
The field of visual computing is rapidly advancing due to the emergence of generative
artificial intelligence (AI), which unlocks unprecedented capabilities for the generation …

Multi3drefer: Grounding text description to multiple 3d objects

Y Zhang, ZM Gong, AX Chang - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
We introduce the task of localizing a flexible number of objects in real-world 3D scenes
using natural language descriptions. Existing 3D visual grounding tasks focus on localizing …

Partslip: Low-shot part segmentation for 3d point clouds via pretrained image-language models

M Liu, Y Zhu, H Cai, S Han, Z Ling… - Proceedings of the …, 2023 - openaccess.thecvf.com
Generalizable 3D part segmentation is important but challenging in vision and robotics.
Training deep models via conventional supervised methods requires large-scale 3D …

Salad: Part-level latent diffusion for 3d shape generation and manipulation

J Koo, S Yoo, MH Nguyen… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
We present a cascaded diffusion model based on a part-level implicit 3D representation. Our
model achieves state-of-the-art generation quality and also enables part-level shape editing …

Satr: Zero-shot semantic segmentation of 3d shapes

A Abdelreheem, I Skorokhodov… - Proceedings of the …, 2023 - openaccess.thecvf.com
We explore the task of zero-shot semantic segmentation of 3D shapes by using large-scale
off-the-shelf 2D im-age recognition models. Surprisingly, we find that modern zero-shot 2D …

ShapeTalk: A language dataset and framework for 3d shape edits and deformations

P Achlioptas, I Huang, M Sung… - Proceedings of the …, 2023 - openaccess.thecvf.com
Editing 3D geometry is a challenging task requiring specialized skills. In this work, we aim to
facilitate the task of editing the geometry of 3D models through the use of natural language …

Mesh2tex: Generating mesh textures from image queries

A Bokhovkin, S Tulsiani, A Dai - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Remarkable advances have been achieved recently in learning neural representations that
characterize object geometry, while generating textured objects suitable for downstream …

Difffacto: Controllable part-based 3d point cloud generation with cross diffusion

GK Nakayama, MA Uy, J Huang… - Proceedings of the …, 2023 - openaccess.thecvf.com
While the community of 3D point cloud generation has witnessed a big growth in recent
years, there still lacks an effective way to enable intuitive user control in the generation …

Text2scene: Text-driven indoor scene stylization with part-aware details

I Hwang, H Kim, YM Kim - … of the IEEE/CVF Conference on …, 2023 - openaccess.thecvf.com
Abstract We propose Text2Scene, a method to automatically create realistic textures for
virtual scenes composed of multiple objects. Guided by a reference image and text …

Scanents3d: Exploiting phrase-to-3d-object correspondences for improved visio-linguistic models in 3d scenes

A Abdelreheem, K Olszewski, HY Lee… - Proceedings of the …, 2024 - openaccess.thecvf.com
The two popular datasets ScanRefer [20] and ReferIt3D [5] connect natural language to real-
world 3D scenes. In this paper, we curate a complementary dataset extending both the …