Searching long egocentric videos with natural language queries (NLQ) has compelling applications in augmented reality and robotics, where a fluid index into everything that a …
The goal in episodic memory (EM) is to search a long egocentric video to answer a natural language query (eg,“where did I leave my purse?”). Existing EM methods exhaustively …
First-person video highlights a camera-wearer's activities in the context of their persistent environment. However, current video understanding approaches reason over visual features …
In this report, we present our champion solution for Ego4D Natural Language Queries (NLQ) Challenge in CVPR 2023. Essentially, to accurately ground in a video, an effective …
This report presents ReLER submission to two tracks in the Ego4D Episodic Memory Benchmark in CVPR 2023, including Natural Language Queries and Moment Queries. This …
Y Feng, H Zhang, Y Xie, Z Li, M Liu, L Nie - arXiv preprint arXiv …, 2024 - arxiv.org
In this report, we present our approach for the Natural Language Query track and Goal Step track of the Ego4D Episodic Memory Benchmark at CVPR 2024. Both challenges require the …
P Bruni, A Falcon, P Radeva - International Conference on Image Analysis …, 2023 - Springer
Episodic memory involves the ability to recall specific events, experiences, and locations from one's past. Humans use this ability to understand the context and significance of past …