Advances in deep concealed scene understanding

DP Fan, GP Ji, P Xu, MM Cheng, C Sakaridis… - Visual Intelligence, 2023 - Springer
Concealed scene understanding (CSU) is a hot computer vision topic aiming to perceive
objects exhibiting camouflage. The current boom in terms of techniques and applications …

How good is Google Bard's visual understanding? an empirical study on open challenges

H Qin, GP Ji, S Khan, DP Fan, FS Khan, LV Gool - 2023 - Springer
Google's Bard has emerged as a formidable competitor to OpenAI's ChatGPT in the field of
conversational AI. Notably, Bard has recently been updated to handle visual inputs …

FoodMask: Real-time food instance counting, segmentation and recognition

HT Nguyen, Y Cao, CW Ngo, WK Chan - Pattern Recognition, 2024 - Elsevier
Food computing has long been studied and deployed to several applications.
Understanding a food image at the instance level, including recognition, counting and …

Towards Automatic Power Battery Detection: New Challenge Benchmark Dataset and Baseline

X Zhao, Y Pang, Z Chen, Q Yu… - Proceedings of the …, 2024 - openaccess.thecvf.com
We conduct a comprehensive study on a new task named power battery detection (PBD)
which aims to localize the dense cathode and anode plates endpoints from X-ray images to …

CoralSCOP: Segment any COral Image on this Planet

Z Zheng, H Liang, BS Hua, YH Wong… - Proceedings of the …, 2024 - openaccess.thecvf.com
Underwater visual understanding has recently gained increasing attention within the
computer vision community for studying and monitoring underwater ecosystems. Among …

A Fixed-Point Approach to Unified Prompt-Based Counting

W Lin, AB Chan - Proceedings of the AAAI Conference on Artificial …, 2024 - ojs.aaai.org
Existing class-agnostic counting models typically rely on a single type of prompt, eg, box
annotations. This paper aims to establish a comprehensive prompt-based counting …

Unlabeled scene adaptive crowd counting via meta-ensemble learning

C Ma, J Zeng, P Shao, A Qing, Y Wang - Transportation research part C …, 2024 - Elsevier
The objective of unlabeled scene adaptive crowd counting (USACC) is to adapt the crowd
counting model to a particular scene by utilizing only a handful of unlabeled images from …

A Density-Guided Temporal Attention Transformer for Indiscernible Object Counting in Underwater Videos

CY Yang, HW Huang, Z Jiang, H Wang… - ICASSP 2024-2024 …, 2024 - ieeexplore.ieee.org
Dense object counting or crowd counting has come a long way thanks to the recent
development in the vision community. However, indiscernible object counting, which aims to …

Noised Autoencoders for Point Annotation Restoration in Object Counting

Y Zou, X Xiao, P Zhou, Z Sun, B Du, Y Xu - arXiv preprint arXiv …, 2023 - arxiv.org
Object counting is a field of growing importance in domains such as security surveillance,
urban planning, and biology. The annotation is usually provided in terms of 2D points …

HDC: Hierarchical Semantic Decoding with Counting Assistance for Generalized Referring Expression Segmentation

Z Luo, Y Wu, Y Liu, Y Xiao, XP Zhang… - arXiv preprint arXiv …, 2024 - arxiv.org
The newly proposed Generalized Referring Expression Segmentation (GRES) amplifies the
formulation of classic RES by involving multiple/non-target scenarios. Recent approaches …