Abstract We present Ego-Exo4D a diverse large-scale multimodal multiview video dataset and benchmark challenge. Ego-Exo4D centers around simultaneously-captured egocentric …
G Yang, H Tang, M Ding, N Sebe… - Proceedings of the …, 2021 - openaccess.thecvf.com
While convolutional neural networks have shown a tremendous impact on various computer vision tasks, they generally demonstrate limitations in explicitly modeling long-range …
State-of-the-art methods in the image-to-image translation are capable of learning a mapping from a source domain to a target domain with unpaired image data. Though the …
What will the future be? We wonder! In this survey, we explore the gap between current research in egocentric vision and the ever-anticipated future, where wearable computing …
H Tang, PHS Torr, N Sebe - IEEE Transactions on Pattern …, 2022 - ieeexplore.ieee.org
We propose a novel model named Multi-Channel Attention Selection Generative Adversarial Network (SelectionGAN) for guided image-to-image translation, where we translate an input …
Y Li, H Liu, H Tang - Proceedings of the AAAI Conference on Artificial …, 2022 - ojs.aaai.org
Multi-modal fusion is proven to be an effective method to improve the accuracy and robustness of speaker tracking, especially in complex scenarios. However, how to combine …
We investigate exocentric-to-egocentric cross-view translation, which aims to generate a first- person (egocentric) view of an actor based on a video recording that captures the actor from …
B Ren, H Tang, N Sebe - arXiv preprint arXiv:2110.10183, 2021 - arxiv.org
It is hard to generate an image at target view well for previous cross-view image translation methods that directly adopt a simple encoder-decoder or U-Net structure, especially for …
HY Lee, YH Li, TH Lee, MS Aslam - Sensors, 2023 - mdpi.com
Unsupervised image-to-image translation has received considerable attention due to the recent remarkable advancements in generative adversarial networks (GANs). In image-to …