Assistive tagging: A survey of multimedia tagging with human-computer joint exploration

M Wang, B Ni, XS Hua, TS Chua - ACM computing surveys (CSUR), 2012 - dl.acm.org
Along with the explosive growth of multimedia data, automatic multimedia tagging has
attracted great interest of various research communities, such as computer vision …

Object class detection: A survey

X Zhang, YH Yang, Z Han, H Wang, C Gao - ACM Computing Surveys …, 2013 - dl.acm.org
Object class detection, also known as category-level object detection, has become one of
the most focused areas in computer vision in the new century. This article attempts to …

Learning visual features from large weakly supervised data

A Joulin, L Van Der Maaten, A Jabri… - Computer Vision–ECCV …, 2016 - Springer
Convolutional networks trained on large supervised datasets produce visual features which
form the basis for the state-of-the-art in many computer-vision problems. Further …

Webly supervised learning of convolutional networks

X Chen, A Gupta - … of the IEEE international conference on …, 2015 - openaccess.thecvf.com
We present an approach to utilize large amounts of web data for learning CNNs. Specifically
inspired by curriculum learning, we present a two-step approach for CNN training. First, we …

A multi-view embedding space for modeling internet images, tags, and their semantics

Y Gong, Q Ke, M Isard, S Lazebnik - International journal of computer …, 2014 - Springer
This paper investigates the problem of modeling Internet images and associated text or tags
for tasks such as image-to-image search, tag-to-image search, and image-to-tag search …

Visual-textual joint relevance learning for tag-based social image search

Y Gao, M Wang, ZJ Zha, J Shen, X Li… - IEEE Transactions on …, 2012 - ieeexplore.ieee.org
Due to the popularity of social media websites, extensive research efforts have been
dedicated to tag-based social image search. Both visual information and tags have been …

Multi-label learning with incomplete class assignments

SS Bucak, R Jin, AK Jain - CVPR 2011, 2011 - ieeexplore.ieee.org
We consider a special type of multi-label learning where class assignments of training
examples are incomplete. As an example, an instance whose true class assignment is (c 1, c …

Uncertainty injection: A deep learning method for robust optimization

W Cui, W Yu - IEEE Transactions on Wireless Communications, 2023 - ieeexplore.ieee.org
This paper proposes a paradigm of uncertainty injection for training deep learning model to
solve robust optimization problems. The majority of existing studies on deep learning focus …

Trecvid semantic indexing of video: A 6-year retrospective

G Awad, CGM Snoek, AF Smeaton… - ITE Transactions on …, 2016 - jstage.jst.go.jp
Semantic indexing, or assigning semantic tags to video samples, is a key component for
content-based access to video documents and collections. The Semantic Indexing task has …

Local coordinate concept factorization for image representation

H Liu, Z Yang, J Yang, Z Wu, X Li - IEEE Transactions on …, 2013 - ieeexplore.ieee.org
Learning sparse representation of high-dimensional data is a state-of-the-art method for
modeling data. Matrix factorization-based techniques, such as nonnegative matrix …