VLAI: Exploration and Exploitation based on Visual-Language Aligned Information for Robotic Object Goal Navigation

H Luo, Y Zeng, L Yang, K Chen, Z Shen, F Lv - Image and Vision …, 2024 - Elsevier
Abstract Object Goal Navigation (ObjectNav) is the task that an agent need navigate to an
instance of a specific category in an unseen environment through visual observations within …

De-noising mask transformer for referring image segmentation

Y Wang, F Lei, B Wang, Q Zhang, X Zhen… - Image and Vision …, 2025 - Elsevier
Abstract Referring Image Segmentation (RIS) is a challenging computer vision task that
involves identifying and segmenting specific objects in an image based on a natural …

Depth-Aware Spatiotemporal Fusion for Advancing Dynamic Hand Gesture Recognition

B Chanda, H Nyeem - Available at SSRN 5011540 - papers.ssrn.com
This paper proposes a novel Depth-Aware Spatiotemporal Fusion (DASF) framework to
improve the accuracy and robustness of dynamic hand gesture recognition in human …