T Guo, H Wang, Y Ma, J Ji, X Sun - … of the AAAI Conference on Artificial …, 2024 - ojs.aaai.org
Recent advancements in single-stage Panoptic Narrative Grounding (PNG) have demonstrated significant potential. These methods predict pixel-level masks by directly …
Z Qian, Y Ma, J Ji, X Sun - Proceedings of the AAAI Conference on …, 2024 - ojs.aaai.org
Referring 3D instance segmentation is a challenging task aimed at accurately segmenting a target instance within a 3D scene based on a given referring expression. However, previous …
D Yang, J Ji, Y Ma, T Guo, H Wang, X Sun… - arXiv preprint arXiv …, 2024 - arxiv.org
In this paper, we introduce SemiRES, a semi-supervised framework that effectively leverages a combination of labeled and unlabeled data to perform RES. A significant hurdle …
D Yang, R Dong, J Ji, Y Ma, H Wang, X Sun… - arXiv preprint arXiv …, 2024 - arxiv.org
Recently, diffusion models have increasingly demonstrated their capabilities in vision understanding. By leveraging prompt-based learning to construct sentences, these models …
Y Ma, J Ji, X Sun, Y Zhou, X Hong, Y Wu… - arXiv preprint arXiv …, 2024 - arxiv.org
This paper explores a novel dynamic network for vision and language tasks, where the inferring structure is customized on the fly for different inputs. Most previous state-of-the-art …
Z Qian, Y Ma, Z Lin, J Ji, X Zheng, X Sun… - arXiv preprint arXiv …, 2024 - Springer
3D referring expression comprehension (3DREC) and segmentation (3DRES) have overlapping objectives, indicating their potential for collaboration. However, existing …