Deep generalized max pooling

V Christlein, L Spranger, M Seuret… - 2019 International …, 2019 - ieeexplore.ieee.org
Global pooling layers are an essential part of Convolutional Neural Networks (CNN). They
are used to aggregate activations of spatial locations to produce a fixed-size vector in …

Encoding CNN activations for writer recognition

V Christlein, A Maier - 2018 13th IAPR international workshop …, 2018 - ieeexplore.ieee.org
The encoding of local features is an essential part for writer identification and writer retrieval.
While CNN activations have already been used as local features in related works, the …

Multi-scale orderless pooling of deep convolutional activation features

Y Gong, L Wang, R Guo, S Lazebnik - … 6-12, 2014, Proceedings, Part VII …, 2014 - Springer
Deep convolutional neural networks (CNN) have shown their promise as a universal
representation for recognition. However, global CNN activations lack geometric invariance …

S3pool: Pooling with stochastic spatial sampling

S Zhai, H Wu, A Kumar, Y Cheng… - Proceedings of the …, 2017 - openaccess.thecvf.com
Feature pooling layers (eg, max pooling) in convolutional neural networks (CNNs) serve the
dual purpose of providing increasingly abstract representations as well as yielding …

Rank-based pooling for deep convolutional neural networks

Z Shi, Y Ye, Y Wu - Neural Networks, 2016 - Elsevier
Pooling is a key mechanism in deep convolutional neural networks (CNNs) which helps to
achieve translation invariance. Numerous studies, both empirically and theoretically, show …

CNN features off-the-shelf: an astounding baseline for recognition

A Sharif Razavian, H Azizpour, J Sullivan… - Proceedings of the …, 2014 - cv-foundation.org
Recent results indicate that the generic descriptors extracted from the convolutional neural
networks are very powerful. This paper adds to the mounting evidence that this is indeed the …

Multigrain: a unified image embedding for classes and instances

M Berman, H Jégou, A Vedaldi, I Kokkinos… - arXiv preprint arXiv …, 2019 - arxiv.org
MultiGrain is a network architecture producing compact vector representations that are
suited both for image classification and particular object retrieval. It builds on a standard …

Recombinator networks: Learning coarse-to-fine feature aggregation

S Honari, J Yosinski, P Vincent… - Proceedings of the IEEE …, 2016 - openaccess.thecvf.com
Deep neural networks with alternating convolutional, max-pooling and decimation layers are
widely used in state of the art architectures for computer vision. Max-pooling purposefully …

A comprehensive study of imagenet pre-training for historical document image analysis

L Studer, M Alberti, V Pondenkandath… - 2019 International …, 2019 - ieeexplore.ieee.org
Automatic analysis of scanned historical documents comprises a wide range of image
analysis tasks, which are often challenging for machine learning due to a lack of human …

Deep transfer learning for art classification problems

M Sabatelli, M Kestemont… - Proceedings Of The …, 2018 - openaccess.thecvf.com
In this paper we investigate whether Deep Convolutional Neural Networks (DCNNs), which
have obtained state of the art results on the ImageNet challenge, are able to perform equally …