H Zhu, MD Luo, R Wang, AH Zheng, R He - International Journal of …, 2021 - Springer
Audio-visual learning, aimed at exploiting the relationship between audio and visual modalities, has drawn considerable attention since deep learning started to be used …
A Swerdlow, R Xu, B Zhou - IEEE Robotics and Automation …, 2024 - ieeexplore.ieee.org
Bird's-Eye View (BEV) Perception has received increasing attention in recent years as it provides a concise and unified spatial representation across views and benefits a diverse …
Image synthesis is a process of converting the input text, sketch, or other sources, ie, another image or mask, into an image. It is an important problem in the computer vision field, where it …
In this paper, we propose an Omni-perception Pre-Trainer (OPT) for cross-modal understanding and generation, by jointly modeling visual, text and audio resources. OPT is …
M Soloveitchik, T Diskin, E Morin, A Wiesel - arXiv preprint arXiv …, 2021 - arxiv.org
We consider distance functions between conditional distributions. We focus on the Wasserstein metric and its Gaussian case known as the Frechet Inception Distance (FID) …
Y Zhou, N Shimada - … of the IEEE/CVF Conference on …, 2023 - openaccess.thecvf.com
Text-to-image generation has attracted significant interest from researchers and practitioners in recent years due to its widespread and diverse applications across various industries …
CW Seo, A Ashtari, J Noh - ACM Transactions on Graphics (TOG), 2023 - dl.acm.org
Sketches reflect the drawing style of individual artists; therefore, it is important to consider their unique styles when extracting sketches from color images for various applications …
In the modern data-driven landscape, organizations are inundated with massive amounts of data, necessitating robust and scalable database solutions (Arzamasova et al., 2020). SQL …
With the increasing interest in various creative scenes such as social media, film production, and intelligence courses, people expect to be able to compile rich visual content according …