Affordance Perception by a Knowledge-Guided Vision-Language Model with Efficient Error Correction

G Burghouts, M Schaaphok, M van Bekkum… - arXiv preprint arXiv …, 2024 - arxiv.org
Mobile robot platforms will increasingly be tasked with activities that involve grasping and
manipulating objects in open world environments. Affordance understanding provides a …

LASER: A Neuro-Symbolic Framework for Learning Spatial-Temporal Scene Graphs with Weak Supervision

J Huang, Z Li, M Naik, SN Lim - arXiv preprint arXiv:2304.07647, 2023 - arxiv.org
We propose LASER, a neuro-symbolic approach to learn semantic video representations
that capture rich spatial and temporal properties in video data by leveraging high-level logic …