Follow Anything: Open-set detection, tracking, and following in real-time

TH Wang, A Maalouf, W Xiao, Y Ban… - … on Robotics and …, 2024 - ieeexplore.ieee.org

As autonomous driving technology matures, end-to-end methodologies have emerged as a
leading strategy, promising seamless integration from perception to control via deep …

被引用次数：11 相关文章所有 4 个版本

[PDF] thecvf.com

PTQ4SAM: Post-Training Quantization for Segment Anything

C Lv, H Chen, J Guo, Y Ding… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com

Abstract Segment Anything Model (SAM) has achieved impressive performance in many
computer vision tasks. However as a large-scale model the immense memory and …

被引用次数：1 相关文章所有 3 个版本

[PDF] arxiv.org

Robohop: Segment-based topological map representation for open-world visual navigation

S Garg, K Rana, M Hosseinzadeh, L Mares… - arXiv preprint arXiv …, 2024 - arxiv.org

Mapping is crucial for spatial reasoning, planning and robot navigation. Existing approaches
range from metric, which require precise geometry-based optimization, to purely topological …

被引用次数：6 相关文章所有 6 个版本

[PDF] arxiv.org

Gaussian Splatting to Real World Flight Navigation Transfer with Liquid Networks

A Quach, M Chahine, A Amini, R Hasani… - arXiv preprint arXiv …, 2024 - arxiv.org

Simulators are powerful tools for autonomous robot learning as they offer scalable data
generation, flexible design, and optimization of trajectories. However, transferring behavior …

被引用次数：1 相关文章所有 3 个版本

Multishot Structured-Light 3-D Scanning for Surfaces in Challenging Motion

M Duan, Y Zheng, Y Jin, J Zheng… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org

Challenging motion, resulting in serious motion artifacts, is a well-known problem in
structured-light (SL) 3-D scanning. Single-shot imaging or tracking interframe offsets …

[PDF] arxiv.org

Probing Multimodal LLMs as World Models for Driving

S Sreeram, TH Wang, A Maalouf, G Rosman… - arXiv preprint arXiv …, 2024 - arxiv.org

We provide a sober look at the application of Multimodal Large Language Models (MLLMs)
within the domain of autonomous driving and challenge/verify some common assumptions …

被引用次数：1 相关文章所有 2 个版本

[PDF] arxiv.org

Track Anything Rapter (TAR)

TV Puthanveettil - arXiv preprint arXiv:2405.11655, 2024 - arxiv.org

Object tracking is a fundamental task in computer vision with broad practical applications
across various domains, including traffic monitoring, robotics, and autonomous vehicle …

高级搜索

QQ 群