Efficient match kernel between sets of features for visual recognition

L Bo, C Sminchisescu - Advances in neural information …, 2009 - proceedings.neurips.cc
In visual recognition, the images are frequently modeled as sets of local features (bags). We
show that bag of words, a common method to handle such cases, can be viewed as a …

[PDF][PDF] A theoretical analysis of feature pooling in visual recognition

YL Boureau, J Ponce, Y LeCun - … of the 27th international conference on …, 2010 - di.ens.fr
Many modern visual recognition algorithms incorporate a step of spatial 'pooling', where the
outputs of several nearby feature detectors are combined into a local or global 'bag of …

Kernel descriptors for visual recognition

L Bo, X Ren, D Fox - Advances in neural information …, 2010 - proceedings.neurips.cc
The design of low-level image features is critical for computer vision algorithms. Orientation
histograms, such as those in SIFT~\cite {Lowe2004Distinctive} and HOG~\cite …

The pyramid match kernel: Discriminative classification with sets of image features

K Grauman, T Darrell - … on Computer Vision (ICCV'05) Volume …, 2005 - ieeexplore.ieee.org
Discriminative learning is challenging when examples are sets of features, and the sets vary
in cardinality and lack any sort of meaningful ordering. Kernel-based classification methods …

Improving the fisher kernel for large-scale image classification

F Perronnin, J Sánchez, T Mensink - … Crete, Greece, September 5-11, 2010 …, 2010 - Springer
The Fisher kernel (FK) is a generic framework which combines the benefits of generative
and discriminative approaches. In the context of image classification the FK was shown to …

Efficiently matching sets of features with random histograms

W Dong, Z Wang, M Charikar, K Li - Proceedings of the 16th ACM …, 2008 - dl.acm.org
As the commonly used representation of a feature-rich data object has evolved from a single
feature vector to a set of feature vectors, a key challenge in building a content-based search …

Hierarchical matching with side information for image classification

Q Chen, Z Song, Y Hua, Z Huang… - 2012 IEEE conference …, 2012 - ieeexplore.ieee.org
In this work, we introduce a hierarchical matching framework with so-called side information
for image classification based on bag-of-words representation. Each image is expressed as …

Image classification with the fisher vector: Theory and practice

J Sánchez, F Perronnin, T Mensink… - International journal of …, 2013 - Springer
A standard approach to describe an image for classification and retrieval purposes is to
extract a set of local patch descriptors, encode them into a high dimensional vector and pool …

Conformer: Local features coupling global representations for visual recognition

Z Peng, W Huang, S Gu, L Xie… - Proceedings of the …, 2021 - openaccess.thecvf.com
Abstract Within Convolutional Neural Network (CNN), the convolution operations are good
at extracting local features but experience difficulty to capture global representations. Within …

Higher-order occurrence pooling for bags-of-words: Visual concept detection

P Koniusz, F Yan, PH Gosselin… - IEEE transactions on …, 2016 - ieeexplore.ieee.org
In object recognition, the Bag-of-Words model assumes: i) extraction of local descriptors
from images, ii) embedding the descriptors by a coder to a given visual vocabulary space …