查看文章

arxiv.org 中的 [PDF]

Class Feature Pyramids for Video Explanation

作者

Alexandros Stergiou, Georgios Kapidis, Grigorios Kalliatakis, Christos Chrysoulas, Ronald Poppe, Veltkamp

发表日期

2019/9/18

研讨会论文

2019 IEEE/CVF International Conference on Computer Vision Workshop (ICCVW)

简介

Deep convolutional networks are widely used in video action recognition. 3D convolutions are one prominent approach to deal with the additional time dimension. While 3D convolutions typically lead to higher accuracies, the inner workings of the trained models are more difficult to interpret. We focus on creating human-understandable visual explanations that represent the hierarchical parts of spatio-temporal networks. We introduce Class Feature Pyramids, a method that traverses the entire network structure and incrementally discovers kernels at different network depths that are informative for a specific class. Our method does not depend on the network's architecture or the type of 3D convolutions, supporting grouped and depth-wise convolutions, convolutions in fibers, and convolutions in branches. We demonstrate the method on six state-of-the-art 3D convolution neural networks (CNNs) on three action …

引用总数

被引用次数：16

20202021202220233 7 2 4

学术搜索中的文章

Class feature pyramids for video explanation

A Stergiou, G Kapidis, G Kalliatakis, C Chrysoulas… - 2019 IEEE/CVF International Conference on Computer …, 2019

被引用次数：16 相关文章所有 10 个版本