TRAM: Global Trajectory and Motion of 3D Humans from in-the-wild Videos

Y Wang, Z Wang, L Liu, K Daniilidis - European Conference on Computer …, 2025 - Springer
We propose TRAM, a two-stage method to reconstruct a human's global trajectory and
motion from in-the-wild videos. TRAM robustifies SLAM to recover the camera motion in the …

Simultaneous Localization and Affordance Prediction for Tasks in Egocentric Video

Z Chavis, HS Park, SJ Guy - arXiv preprint arXiv:2407.13856, 2024 - arxiv.org
Vision-Language Models (VLMs) have shown great success as foundational models for
downstream vision and natural language applications in a variety of domains. However …