Deep learning and multi-modal fusion for real-time multi-object tracking: Algorithms, challenges, datasets, and comparative study

X Wang, Z Sun, A Chehri, G Jeon, Y Song - Information Fusion, 2024 - Elsevier
Real-time multi-object tracking (MOT) is a complex task involving detecting and tracking
multiple objects. After the objects are detected, they are assigned markers, and their …

Pointpatchmix: Point cloud mixing with patch scoring

Y Wang, J Wang, J Li, Z Zhao, G Chen, A Liu… - Proceedings of the …, 2024 - ojs.aaai.org
Data augmentation is an effective regularization strategy for mitigating overfitting in deep
neural networks, and it plays a crucial role in 3D vision tasks, where the point cloud data is …

VideoSAM: Open-World Video Segmentation

P Guo, Z Zhao, J Gao, C Wu, T He, Z Zhang… - arXiv preprint arXiv …, 2024 - arxiv.org
Video segmentation is essential for advancing robotics and autonomous driving, particularly
in open-world settings where continuous perception and object association across video …

Temporally Consistent Object-Centric Learning by Contrasting Slots

A Manasyan, M Seitzer, F Radovic, G Martius… - arXiv preprint arXiv …, 2024 - arxiv.org
Unsupervised object-centric learning from videos is a promising approach to extract
structured representations from large, unlabeled collections of videos. To support …

OpenSlot: Mixed Open-set Recognition with Object-centric Learning

X Yin, F Pan, G An, Y Huo, Z Xie, SE Yoon - arXiv preprint arXiv …, 2024 - arxiv.org
Existing open-set recognition (OSR) studies typically assume that each image contains only
one class label, and the unknown test set (negative) has a disjoint label space from the …

Object Tracking in a View: A Novel Perspective on Bridging the Gap to Biomedical Advancements

MS Fazli, S Quinn - arXiv preprint arXiv:2412.01119, 2024 - arxiv.org
Object tracking is a fundamental tool in modern innovation, with applications in defense
systems, autonomous vehicles, and biomedical research. It enables precise identification …

Unsupervised Dynamics Prediction with Object-Centric Kinematics

YJ Song, S Choi, J Kim, JH Kim, BT Zhang - arXiv preprint arXiv …, 2024 - arxiv.org
Human perception involves discerning complex multi-object scenes into time-static object
appearance (\ie, size, shape, color) and time-varying object motion (\ie, location, velocity …

Scene Understanding

W Liu, H Hao, H Wang, Z Zou, W Xing - Graph Neural Network Methods …, 2025 - Springer
A scene could be a natural area or a place of human-made where human being could act
within. Usually, the man-made scene is divided into indoor scene and outdoor scene. Since …

[PDF][PDF] Discovering and Using Structure in Autonomous Machine Learning

A Zadaianchuk - 2024 - research-collection.ethz.ch
The ability to autonomously understand complex environments and act in them is an
essential goal in artificial agents' development. State-of-the-art agents may excel in …

Improved multi object tracking with locality sensitive hashing

AJ Chemmanam, B Jose, A Moopan - Pattern Analysis and Applications, 2024 - Springer
Object tracking is one of the most advanced applications of computer vision algorithms.
While various tracking approaches have been previously developed, they often use many …