Unifying 3d vision-language understanding via promptable queries

Z Zhu, Z Zhang, X Ma, X Niu, Y Chen, B Jia… - … on Computer Vision, 2025 - Springer
A unified model for 3D vision-language (3D-VL) understanding is expected to take various
scene representations and perform a wide range of tasks in a 3D scene. However, a …

Improved mask R-CNN multi-target detection and segmentation for autonomous driving in complex scenes

S Fang, B Zhang, J Hu - Sensors, 2023 - mdpi.com
Vision-based target detection and segmentation has been an important research content for
environment perception in autonomous driving, but the mainstream target detection and …

Review of deep learning approaches in solving rock fragmentation problems

MV Ronkin, EN Akimova, VE Misilov - 2023 - elar.urfu.ru
One of the most significant challenges of the mining industry is resource yield estimation
from visual data. An example would be identification of the rock chunk distribution …

FoodMask: Real-time food instance counting, segmentation and recognition

HT Nguyen, Y Cao, CW Ngo, WK Chan - Pattern Recognition, 2024 - Elsevier
Food computing has long been studied and deployed to several applications.
Understanding a food image at the instance level, including recognition, counting and …

KepSalinst: Using peripheral points to delineate salient instances

J Chen, R Cong, HHS Ip… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Salient instance segmentation (SIS) is an emerging field that evolves from salient object
detection (SOD), aiming at identifying individual salient instances using segmentation maps …

Dformer: Diffusion-guided transformer for universal image segmentation

H Wang, J Cao, RM Anwer, J Xie, FS Khan… - arXiv preprint arXiv …, 2023 - arxiv.org
This paper introduces an approach, named DFormer, for universal image segmentation. The
proposed DFormer views universal image segmentation task as a denoising process using …

Extreme Point Supervised Instance Segmentation

H Lee, S Hwang, S Kwak - … of the IEEE/CVF Conference on …, 2024 - openaccess.thecvf.com
This paper introduces a novel approach to learning instance segmentation using extreme
points ie the topmost leftmost bottommost and rightmost points of each object. These points …

LayoutFormer: Hierarchical Text Detection Towards Scene Text Understanding

M Liang, JW Ma, X Zhu, J Qin… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
Existing scene text detectors generally focus on accurately detecting single-level (ie word-
level line-level or paragraph-level) text entities without exploring the relationships among …

Mobileinst: Video instance segmentation on the mobile

R Zhang, T Cheng, S Yang, H Jiang, S Zhang… - Proceedings of the …, 2024 - ojs.aaai.org
Video instance segmentation on mobile devices is an important yet very challenging edge AI
problem. It mainly suffers from (1) heavy computation and memory costs for frame-by-frame …

Real-time evaluation of the blending uniformity of industrially produced gravelly soil based on Cond-YOLOv8-seg

Y Hu, J Wang, X Wang, Y Sun, H Yu, J Zhang - Journal of Industrial …, 2024 - Elsevier
Industrial production of gravelly soil has been applied at construction sites in earth-rock
engineering, which adopts automatic blending and belt-conveyer systems for gravelly soil …