Abstract Model pre-training is a cornerstone of modern visual recognition systems. Although fully supervised pre-training on datasets like ImageNet is still the de-facto standard, recent …
T Barrett, Q Chen, A Zhang - Proceedings of the 2023 ACM Conference …, 2023 - dl.acm.org
To investigate the well-observed racial disparities in computer vision systems that analyze images of humans, researchers have turned to skin tone as a more objective annotation …
Learned embeddings for products are an important building block for web-scale e- commerce recommendation systems. At Pinterest, we build a single set of product …
The infusion of generative artificial intelligence (AI), as exemplified by models such as ChatGPT and Bard is proving to be a revolutionary catalyst within the building and …
S Dahan, A Fawaz, LZJ Williams… - … on Medical Imaging …, 2022 - proceedings.mlr.press
The extension of convolutional neural networks (CNNs) to non-Euclidean geometries has led to multiple frameworks for studying manifolds. Many of those methods have shown …
Gait analysis is proven to be a reliable way to perform person identification without relying on subject cooperation. Walking is a biometric that does not significantly change in short …
Y Zhang, Q Sun, Y Zhou, Z He, Z Yin, K Wang… - arXiv preprint arXiv …, 2022 - arxiv.org
Large-scale datasets play a vital role in computer vision. But current datasets are annotated blindly without differentiation to samples, making the data collection inefficient and …
C Cianfarani, AN Bhagoji, V Sehwag… - Advances in …, 2022 - proceedings.neurips.cc
Abstract Representation learning,\textit {ie} the generation of representations useful for downstream applications, is a task of fundamental importance that underlies much of the …
The advent of text-image models, most notably CLIP, has significantly transformed the landscape of information retrieval. These models enable the fusion of various modalities …